Overview
Brought to you by YData
Dataset statistics
| Number of variables | 138 |
|---|---|
| Number of observations | 988402 |
| Missing cells | 69735969 |
| Missing cells (%) | 51.1% |
| Total size in memory | 1.0 GiB |
| Average record size in memory | 1.1 KiB |
Variable types
| Text | 138 |
|---|
Dataset
| Description | US NMNH Extant Specimen Records 0052487-241126133413365 |
|---|---|
| URL | https://doi.org/10.15468/dl.wttrju |
license has constant value "CC0_1_0" | Constant |
publisher has constant value "National Museum of Natural History, Smithsonian Institution" | Constant |
institutionID has constant value "urn:lsid:biocol.org:col:15463" | Constant |
collectionID has constant value "urn:uuid:60e28f81-e634-4869-aa3e-732caed713c8" | Constant |
institutionCode has constant value "US" | Constant |
collectionCode has constant value "US" | Constant |
datasetName has constant value "NMNH Extant Biology" | Constant |
occurrenceStatus has constant value "PRESENT" | Constant |
verbatimSRS has constant value "1938-11-11" | Constant |
footprintSRS has constant value "315" | Constant |
footprintSpatialFit has constant value "315" | Constant |
georeferencedBy has constant value "1938" | Constant |
georeferencedDate has constant value "11" | Constant |
georeferenceSources has constant value "11 Nov 1938" | Constant |
latestEpochOrHighestSeries has constant value "South America - Neotropics, Colombia, Meta" | Constant |
earliestAgeOrLowestStage has constant value "SOUTH_AMERICA" | Constant |
lowestBiostratigraphicZone has constant value "7296210" | Constant |
lithostratigraphicTerms has constant value "CO" | Constant |
group has constant value "Meta" | Constant |
dateIdentified has constant value "Plantae, Dicotyledonae, Malpighiales, Violaceae, Violoideae" | Constant |
identificationReferences has constant value "Plantae" | Constant |
identificationVerificationStatus has constant value "Tracheophyta" | Constant |
identificationRemarks has constant value "Magnoliopsida" | Constant |
taxonID has constant value "Malpighiales" | Constant |
namePublishedInID has constant value "Rinorea" | Constant |
taxonConceptID has constant value "Rinorea" | Constant |
parentNameUsage has constant value "pubiflora" | Constant |
originalNameUsage has constant value "pubiflora" | Constant |
namePublishedIn has constant value "VARIETY" | Constant |
superfamily has constant value "821cc27a-e3bb-4bc5-ac34-89ada245069d" | Constant |
subfamily has constant value "2024-12-02T13:57:09.776Z" | Constant |
tribe has constant value "450.0" | Constant |
subtribe has constant value "50.0" | Constant |
infragenericEpithet has constant value "OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT" | Constant |
cultivarEpithet has constant value "false" | Constant |
verbatimTaxonRank has constant value "7296210" | Constant |
nomenclaturalCode has constant value "7707728" | Constant |
nomenclaturalStatus has constant value "1414" | Constant |
taxonRemarks has constant value "6631" | Constant |
publishingCountry has constant value "US" | Constant |
subgenusKey has constant value "Magnoliopsida" | Constant |
protocol has constant value "EML" | Constant |
projectId has constant value "edulis" | Constant |
catalogNumber has 132504 (13.4%) missing values | Missing |
recordedBy has 11879 (1.2%) missing values | Missing |
lifeStage has 916836 (92.8%) missing values | Missing |
preparations has 959242 (97.0%) missing values | Missing |
associatedSequences has 988328 (> 99.9%) missing values | Missing |
occurrenceRemarks has 968411 (98.0%) missing values | Missing |
fieldNumber has 988343 (> 99.9%) missing values | Missing |
eventDate has 119809 (12.1%) missing values | Missing |
startDayOfYear has 261666 (26.5%) missing values | Missing |
endDayOfYear has 261666 (26.5%) missing values | Missing |
year has 122319 (12.4%) missing values | Missing |
month has 181983 (18.4%) missing values | Missing |
day has 314697 (31.8%) missing values | Missing |
verbatimEventDate has 655426 (66.3%) missing values | Missing |
habitat has 877971 (88.8%) missing values | Missing |
locationID has 979422 (99.1%) missing values | Missing |
continent has 32788 (3.3%) missing values | Missing |
waterBody has 984227 (99.6%) missing values | Missing |
islandGroup has 963568 (97.5%) missing values | Missing |
island has 906001 (91.7%) missing values | Missing |
countryCode has 10855 (1.1%) missing values | Missing |
stateProvince has 219376 (22.2%) missing values | Missing |
county has 826754 (83.6%) missing values | Missing |
locality has 72708 (7.4%) missing values | Missing |
verbatimDepth has 983702 (99.5%) missing values | Missing |
decimalLatitude has 841005 (85.1%) missing values | Missing |
decimalLongitude has 841005 (85.1%) missing values | Missing |
coordinateUncertaintyInMeters has 987002 (99.9%) missing values | Missing |
verbatimCoordinateSystem has 980404 (99.2%) missing values | Missing |
verbatimSRS has 988401 (> 99.9%) missing values | Missing |
footprintSRS has 988401 (> 99.9%) missing values | Missing |
footprintSpatialFit has 988401 (> 99.9%) missing values | Missing |
georeferencedBy has 988401 (> 99.9%) missing values | Missing |
georeferencedDate has 988401 (> 99.9%) missing values | Missing |
georeferenceProtocol has 960543 (97.2%) missing values | Missing |
georeferenceSources has 988401 (> 99.9%) missing values | Missing |
georeferenceRemarks has 988289 (> 99.9%) missing values | Missing |
latestEpochOrHighestSeries has 988401 (> 99.9%) missing values | Missing |
earliestAgeOrLowestStage has 988401 (> 99.9%) missing values | Missing |
lowestBiostratigraphicZone has 988401 (> 99.9%) missing values | Missing |
lithostratigraphicTerms has 988401 (> 99.9%) missing values | Missing |
group has 988401 (> 99.9%) missing values | Missing |
bed has 988400 (> 99.9%) missing values | Missing |
identificationQualifier has 985985 (99.8%) missing values | Missing |
typeStatus has 967033 (97.8%) missing values | Missing |
identifiedBy has 866335 (87.7%) missing values | Missing |
dateIdentified has 988401 (> 99.9%) missing values | Missing |
identificationReferences has 988401 (> 99.9%) missing values | Missing |
identificationVerificationStatus has 988401 (> 99.9%) missing values | Missing |
identificationRemarks has 988401 (> 99.9%) missing values | Missing |
taxonID has 988401 (> 99.9%) missing values | Missing |
namePublishedInID has 988401 (> 99.9%) missing values | Missing |
taxonConceptID has 988401 (> 99.9%) missing values | Missing |
parentNameUsage has 988401 (> 99.9%) missing values | Missing |
originalNameUsage has 988401 (> 99.9%) missing values | Missing |
namePublishedIn has 988401 (> 99.9%) missing values | Missing |
order has 10135 (1.0%) missing values | Missing |
superfamily has 988401 (> 99.9%) missing values | Missing |
family has 10432 (1.1%) missing values | Missing |
subfamily has 988401 (> 99.9%) missing values | Missing |
tribe has 988401 (> 99.9%) missing values | Missing |
subtribe has 988401 (> 99.9%) missing values | Missing |
genus has 15345 (1.6%) missing values | Missing |
genericName has 15400 (1.6%) missing values | Missing |
infragenericEpithet has 988401 (> 99.9%) missing values | Missing |
specificEpithet has 75483 (7.6%) missing values | Missing |
infraspecificEpithet has 923675 (93.5%) missing values | Missing |
cultivarEpithet has 988401 (> 99.9%) missing values | Missing |
verbatimTaxonRank has 988401 (> 99.9%) missing values | Missing |
vernacularName has 988400 (> 99.9%) missing values | Missing |
nomenclaturalCode has 988401 (> 99.9%) missing values | Missing |
nomenclaturalStatus has 988401 (> 99.9%) missing values | Missing |
taxonRemarks has 988401 (> 99.9%) missing values | Missing |
elevation has 625728 (63.3%) missing values | Missing |
elevationAccuracy has 880635 (89.1%) missing values | Missing |
depth has 979722 (99.1%) missing values | Missing |
depthAccuracy has 980482 (99.2%) missing values | Missing |
distanceFromCentroidInMeters has 987807 (99.9%) missing values | Missing |
mediaType has 69371 (7.0%) missing values | Missing |
orderKey has 10134 (1.0%) missing values | Missing |
familyKey has 10432 (1.1%) missing values | Missing |
genusKey has 15344 (1.6%) missing values | Missing |
subgenusKey has 988401 (> 99.9%) missing values | Missing |
speciesKey has 75442 (7.6%) missing values | Missing |
species has 75443 (7.6%) missing values | Missing |
projectId has 988401 (> 99.9%) missing values | Missing |
gbifRegion has 19586 (2.0%) missing values | Missing |
level0Gid has 854767 (86.5%) missing values | Missing |
level0Name has 854767 (86.5%) missing values | Missing |
level1Gid has 855021 (86.5%) missing values | Missing |
level1Name has 855020 (86.5%) missing values | Missing |
level2Gid has 859029 (86.9%) missing values | Missing |
level2Name has 859040 (86.9%) missing values | Missing |
level3Gid has 953538 (96.5%) missing values | Missing |
level3Name has 953860 (96.5%) missing values | Missing |
iucnRedListCategory has 91545 (9.3%) missing values | Missing |
gbifID has unique values | Unique |
occurrenceID has unique values | Unique |
Reproduction
| Analysis started | 2025-01-08 22:48:51.675757 |
|---|---|
| Analysis finished | 2025-01-08 22:49:46.176351 |
| Duration | 54.5 seconds |
| Software version | ydata-profiling vv4.12.1 |
| Download configuration | config.json |
Variables
gbifID
Text
Unique 
| Distinct | 988402 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.5 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 988402 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1320179379 |
|---|---|
| 2nd row | 1675994101 |
| 3rd row | 2592240144 |
| 4th row | 2571494932 |
| 5th row | 3357270605 |
| Value | Count | Frequency (%) |
| 1320179379 | 1 | < 0.1% |
| 1320208262 | 1 | < 0.1% |
| 1320183762 | 1 | < 0.1% |
| 1321737296 | 1 | < 0.1% |
| 1320181414 | 1 | < 0.1% |
| 2592240144 | 1 | < 0.1% |
| 2571494932 | 1 | < 0.1% |
| 3357270605 | 1 | < 0.1% |
| 1321730091 | 1 | < 0.1% |
| 1320180447 | 1 | < 0.1% |
| Other values (988392) | 988392 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1442218 | |
| 2 | 1379595 | |
| 3 | 1292646 | |
| 5 | 938230 | |
| 6 | 853796 | |
| 4 | 850434 | |
| 7 | 817947 | |
| 8 | 805418 | |
| 0 | 782405 | |
| 9 | 721331 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9884020 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1442218 | |
| 2 | 1379595 | |
| 3 | 1292646 | |
| 5 | 938230 | |
| 6 | 853796 | |
| 4 | 850434 | |
| 7 | 817947 | |
| 8 | 805418 | |
| 0 | 782405 | |
| 9 | 721331 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 9884020 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1442218 | |
| 2 | 1379595 | |
| 3 | 1292646 | |
| 5 | 938230 | |
| 6 | 853796 | |
| 4 | 850434 | |
| 7 | 817947 | |
| 8 | 805418 | |
| 0 | 782405 | |
| 9 | 721331 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9884020 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1442218 | |
| 2 | 1379595 | |
| 3 | 1292646 | |
| 5 | 938230 | |
| 6 | 853796 | |
| 4 | 850434 | |
| 7 | 817947 | |
| 8 | 805418 | |
| 0 | 782405 | |
| 9 | 721331 |
license
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | CC0_1_0 |
|---|---|
| 2nd row | CC0_1_0 |
| 3rd row | CC0_1_0 |
| 4th row | CC0_1_0 |
| 5th row | CC0_1_0 |
| Value | Count | Frequency (%) |
| cc0_1_0 | 988402 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 1976804 | |
| 0 | 1976804 | |
| _ | 1976804 | |
| 1 | 988402 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2965206 | |
| Uppercase Letter | 1976804 | |
| Connector Punctuation | 1976804 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 1976804 | |
| 1 | 988402 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1976804 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1976804 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4942010 | |
| Latin | 1976804 | 28.6% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 1976804 | |
| _ | 1976804 | |
| 1 | 988402 |
Latin
| Value | Count | Frequency (%) |
| C | 1976804 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6918814 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 1976804 | |
| 0 | 1976804 | |
| _ | 1976804 | |
| 1 | 988402 |
modified
Text
| Distinct | 103380 |
|---|---|
| Distinct (%) | 10.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.5 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 20 |
| Mean length | 20 |
| Min length | 20 |
Unique
| Unique | 39084 ? |
|---|---|
| Unique (%) | 4.0% |
Sample
| 1st row | 2016-08-30T13:42:00Z |
|---|---|
| 2nd row | 2022-10-26T17:57:00Z |
| 3rd row | 2020-05-10T23:06:00Z |
| 4th row | 2020-04-09T11:53:00Z |
| 5th row | 2021-09-10T21:16:00Z |
| Value | Count | Frequency (%) |
| 2024-10-17t09:48:00z | 1536 | 0.2% |
| 2024-10-17t09:52:00z | 1531 | 0.2% |
| 2024-10-17t09:51:00z | 1451 | 0.1% |
| 2024-10-17t09:55:00z | 1419 | 0.1% |
| 2024-10-17t09:49:00z | 1377 | 0.1% |
| 2024-10-17t09:50:00z | 1314 | 0.1% |
| 2024-10-17t09:53:00z | 1255 | 0.1% |
| 2024-10-17t09:54:00z | 1248 | 0.1% |
| 2024-10-17t09:57:00z | 1194 | 0.1% |
| 2024-10-17t09:56:00z | 1136 | 0.1% |
| Other values (103370) | 974941 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 5048546 | |
| 2 | 2613229 | |
| 1 | 2474741 | |
| - | 1976804 | 10.0% |
| : | 1976804 | 10.0% |
| T | 988402 | 5.0% |
| Z | 988402 | 5.0% |
| 3 | 648649 | 3.3% |
| 8 | 570563 | 2.9% |
| 9 | 562325 | 2.8% |
| Other values (4) | 1919575 | 9.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 13837628 | |
| Dash Punctuation | 1976804 | 10.0% |
| Other Punctuation | 1976804 | 10.0% |
| Uppercase Letter | 1976804 | 10.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 5048546 | |
| 2 | 2613229 | |
| 1 | 2474741 | |
| 3 | 648649 | 4.7% |
| 8 | 570563 | 4.1% |
| 9 | 562325 | 4.1% |
| 4 | 522023 | 3.8% |
| 7 | 507317 | 3.7% |
| 5 | 461935 | 3.3% |
| 6 | 428300 | 3.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 988402 | |
| Z | 988402 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1976804 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1976804 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 17791236 | |
| Latin | 1976804 | 10.0% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 5048546 | |
| 2 | 2613229 | |
| 1 | 2474741 | |
| - | 1976804 | 11.1% |
| : | 1976804 | 11.1% |
| 3 | 648649 | 3.6% |
| 8 | 570563 | 3.2% |
| 9 | 562325 | 3.2% |
| 4 | 522023 | 2.9% |
| 7 | 507317 | 2.9% |
| Other values (2) | 890235 | 5.0% |
Latin
| Value | Count | Frequency (%) |
| T | 988402 | |
| Z | 988402 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19768040 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 5048546 | |
| 2 | 2613229 | |
| 1 | 2474741 | |
| - | 1976804 | 10.0% |
| : | 1976804 | 10.0% |
| T | 988402 | 5.0% |
| Z | 988402 | 5.0% |
| 3 | 648649 | 3.3% |
| 8 | 570563 | 2.9% |
| 9 | 562325 | 2.8% |
| Other values (4) | 1919575 | 9.7% |
publisher
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.5 MiB |
Length
| Max length | 59 |
|---|---|
| Median length | 59 |
| Mean length | 59 |
| Min length | 59 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | National Museum of Natural History, Smithsonian Institution |
|---|---|
| 2nd row | National Museum of Natural History, Smithsonian Institution |
| 3rd row | National Museum of Natural History, Smithsonian Institution |
| 4th row | National Museum of Natural History, Smithsonian Institution |
| 5th row | National Museum of Natural History, Smithsonian Institution |
| Value | Count | Frequency (%) |
| national | 988402 | |
| museum | 988402 | |
| of | 988402 | |
| natural | 988402 | |
| history | 988402 | |
| smithsonian | 988402 | |
| institution | 988402 |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 6918814 | |
| i | 5930412 | |
| 5930412 | ||
| a | 4942010 | 8.5% |
| o | 4942010 | 8.5% |
| n | 4942010 | 8.5% |
| s | 3953608 | 6.8% |
| u | 3953608 | 6.8% |
| r | 1976804 | 3.4% |
| m | 1976804 | 3.4% |
| Other values (11) | 12849226 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 45466492 | |
| Space Separator | 5930412 | 10.2% |
| Uppercase Letter | 5930412 | 10.2% |
| Other Punctuation | 988402 | 1.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 6918814 | |
| i | 5930412 | |
| a | 4942010 | |
| o | 4942010 | |
| n | 4942010 | |
| s | 3953608 | |
| u | 3953608 | |
| r | 1976804 | 4.3% |
| m | 1976804 | 4.3% |
| l | 1976804 | 4.3% |
| Other values (4) | 3953608 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1976804 | |
| M | 988402 | |
| H | 988402 | |
| S | 988402 | |
| I | 988402 |
Space Separator
| Value | Count | Frequency (%) |
| 5930412 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 988402 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 51396904 | |
| Common | 6918814 | 11.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 6918814 | |
| i | 5930412 | |
| a | 4942010 | |
| o | 4942010 | |
| n | 4942010 | |
| s | 3953608 | 7.7% |
| u | 3953608 | 7.7% |
| r | 1976804 | 3.8% |
| m | 1976804 | 3.8% |
| N | 1976804 | 3.8% |
| Other values (9) | 9884020 |
Common
| Value | Count | Frequency (%) |
| 5930412 | ||
| , | 988402 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 58315718 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 6918814 | |
| i | 5930412 | |
| 5930412 | ||
| a | 4942010 | 8.5% |
| o | 4942010 | 8.5% |
| n | 4942010 | 8.5% |
| s | 3953608 | 6.8% |
| u | 3953608 | 6.8% |
| r | 1976804 | 3.4% |
| m | 1976804 | 3.4% |
| Other values (11) | 12849226 |
institutionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.5 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 29 |
| Mean length | 29 |
| Min length | 29 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | urn:lsid:biocol.org:col:15463 |
|---|---|
| 2nd row | urn:lsid:biocol.org:col:15463 |
| 3rd row | urn:lsid:biocol.org:col:15463 |
| 4th row | urn:lsid:biocol.org:col:15463 |
| 5th row | urn:lsid:biocol.org:col:15463 |
| Value | Count | Frequency (%) |
| urn:lsid:biocol.org:col:15463 | 988402 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 3953608 | |
| : | 3953608 | |
| l | 2965206 | 10.3% |
| i | 1976804 | 6.9% |
| r | 1976804 | 6.9% |
| c | 1976804 | 6.9% |
| g | 988402 | 3.4% |
| 6 | 988402 | 3.4% |
| 4 | 988402 | 3.4% |
| 5 | 988402 | 3.4% |
| Other values (8) | 7907216 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 18779638 | |
| Other Punctuation | 4942010 | 17.2% |
| Decimal Number | 4942010 | 17.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 3953608 | |
| l | 2965206 | |
| i | 1976804 | |
| r | 1976804 | |
| c | 1976804 | |
| g | 988402 | 5.3% |
| u | 988402 | 5.3% |
| b | 988402 | 5.3% |
| d | 988402 | 5.3% |
| s | 988402 | 5.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 988402 | |
| 4 | 988402 | |
| 5 | 988402 | |
| 1 | 988402 | |
| 3 | 988402 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 3953608 | |
| . | 988402 | 20.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 18779638 | |
| Common | 9884020 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 3953608 | |
| l | 2965206 | |
| i | 1976804 | |
| r | 1976804 | |
| c | 1976804 | |
| g | 988402 | 5.3% |
| u | 988402 | 5.3% |
| b | 988402 | 5.3% |
| d | 988402 | 5.3% |
| s | 988402 | 5.3% |
Common
| Value | Count | Frequency (%) |
| : | 3953608 | |
| 6 | 988402 | 10.0% |
| 4 | 988402 | 10.0% |
| 5 | 988402 | 10.0% |
| 1 | 988402 | 10.0% |
| . | 988402 | 10.0% |
| 3 | 988402 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28663658 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 3953608 | |
| : | 3953608 | |
| l | 2965206 | 10.3% |
| i | 1976804 | 6.9% |
| r | 1976804 | 6.9% |
| c | 1976804 | 6.9% |
| g | 988402 | 3.4% |
| 6 | 988402 | 3.4% |
| 4 | 988402 | 3.4% |
| 5 | 988402 | 3.4% |
| Other values (8) | 7907216 |
collectionID
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.5 MiB |
Length
| Max length | 45 |
|---|---|
| Median length | 45 |
| Mean length | 45 |
| Min length | 45 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | urn:uuid:60e28f81-e634-4869-aa3e-732caed713c8 |
|---|---|
| 2nd row | urn:uuid:60e28f81-e634-4869-aa3e-732caed713c8 |
| 3rd row | urn:uuid:60e28f81-e634-4869-aa3e-732caed713c8 |
| 4th row | urn:uuid:60e28f81-e634-4869-aa3e-732caed713c8 |
| 5th row | urn:uuid:60e28f81-e634-4869-aa3e-732caed713c8 |
| Value | Count | Frequency (%) |
| urn:uuid:60e28f81-e634-4869-aa3e-732caed713c8 | 988402 |
Most occurring characters
| Value | Count | Frequency (%) |
| 8 | 3953608 | 8.9% |
| 3 | 3953608 | 8.9% |
| - | 3953608 | 8.9% |
| e | 3953608 | 8.9% |
| 6 | 2965206 | 6.7% |
| a | 2965206 | 6.7% |
| u | 2965206 | 6.7% |
| d | 1976804 | 4.4% |
| 2 | 1976804 | 4.4% |
| 1 | 1976804 | 4.4% |
| Other values (10) | 13837628 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 20756442 | |
| Lowercase Letter | 17791236 | |
| Dash Punctuation | 3953608 | 8.9% |
| Other Punctuation | 1976804 | 4.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 3953608 | |
| 3 | 3953608 | |
| 6 | 2965206 | |
| 2 | 1976804 | |
| 1 | 1976804 | |
| 4 | 1976804 | |
| 7 | 1976804 | |
| 0 | 988402 | 4.8% |
| 9 | 988402 | 4.8% |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 3953608 | |
| a | 2965206 | |
| u | 2965206 | |
| d | 1976804 | |
| c | 1976804 | |
| r | 988402 | 5.6% |
| f | 988402 | 5.6% |
| i | 988402 | 5.6% |
| n | 988402 | 5.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3953608 |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1976804 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 26686854 | |
| Latin | 17791236 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 8 | 3953608 | |
| 3 | 3953608 | |
| - | 3953608 | |
| 6 | 2965206 | |
| 2 | 1976804 | |
| 1 | 1976804 | |
| : | 1976804 | |
| 4 | 1976804 | |
| 7 | 1976804 | |
| 0 | 988402 | 3.7% |
Latin
| Value | Count | Frequency (%) |
| e | 3953608 | |
| a | 2965206 | |
| u | 2965206 | |
| d | 1976804 | |
| c | 1976804 | |
| r | 988402 | 5.6% |
| f | 988402 | 5.6% |
| i | 988402 | 5.6% |
| n | 988402 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 44478090 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 8 | 3953608 | 8.9% |
| 3 | 3953608 | 8.9% |
| - | 3953608 | 8.9% |
| e | 3953608 | 8.9% |
| 6 | 2965206 | 6.7% |
| a | 2965206 | 6.7% |
| u | 2965206 | 6.7% |
| d | 1976804 | 4.4% |
| 2 | 1976804 | 4.4% |
| 1 | 1976804 | 4.4% |
| Other values (10) | 13837628 |
institutionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | US |
|---|---|
| 2nd row | US |
| 3rd row | US |
| 4th row | US |
| 5th row | US |
| Value | Count | Frequency (%) |
| us | 988402 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 988402 | |
| S | 988402 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1976804 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 988402 | |
| S | 988402 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1976804 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 988402 | |
| S | 988402 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1976804 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 988402 | |
| S | 988402 |
collectionCode
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | US |
|---|---|
| 2nd row | US |
| 3rd row | US |
| 4th row | US |
| 5th row | US |
| Value | Count | Frequency (%) |
| us | 988402 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 988402 | |
| S | 988402 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1976804 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 988402 | |
| S | 988402 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1976804 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 988402 | |
| S | 988402 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1976804 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 988402 | |
| S | 988402 |
datasetName
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.5 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 19 |
| Mean length | 19 |
| Min length | 19 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NMNH Extant Biology |
|---|---|
| 2nd row | NMNH Extant Biology |
| 3rd row | NMNH Extant Biology |
| 4th row | NMNH Extant Biology |
| 5th row | NMNH Extant Biology |
| Value | Count | Frequency (%) |
| nmnh | 988402 | |
| extant | 988402 | |
| biology | 988402 |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 1976804 | 10.5% |
| 1976804 | 10.5% | |
| t | 1976804 | 10.5% |
| o | 1976804 | 10.5% |
| M | 988402 | 5.3% |
| H | 988402 | 5.3% |
| E | 988402 | 5.3% |
| x | 988402 | 5.3% |
| a | 988402 | 5.3% |
| n | 988402 | 5.3% |
| Other values (5) | 4942010 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10872422 | |
| Uppercase Letter | 5930412 | |
| Space Separator | 1976804 | 10.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 1976804 | |
| o | 1976804 | |
| x | 988402 | |
| a | 988402 | |
| n | 988402 | |
| i | 988402 | |
| l | 988402 | |
| g | 988402 | |
| y | 988402 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1976804 | |
| M | 988402 | |
| H | 988402 | |
| E | 988402 | |
| B | 988402 |
Space Separator
| Value | Count | Frequency (%) |
| 1976804 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16802834 | |
| Common | 1976804 | 10.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 1976804 | |
| t | 1976804 | |
| o | 1976804 | |
| M | 988402 | 5.9% |
| H | 988402 | 5.9% |
| E | 988402 | 5.9% |
| x | 988402 | 5.9% |
| a | 988402 | 5.9% |
| n | 988402 | 5.9% |
| B | 988402 | 5.9% |
| Other values (4) | 3953608 |
Common
| Value | Count | Frequency (%) |
| 1976804 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18779638 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 1976804 | 10.5% |
| 1976804 | 10.5% | |
| t | 1976804 | 10.5% |
| o | 1976804 | 10.5% |
| M | 988402 | 5.3% |
| H | 988402 | 5.3% |
| E | 988402 | 5.3% |
| x | 988402 | 5.3% |
| a | 988402 | 5.3% |
| n | 988402 | 5.3% |
| Other values (5) | 4942010 |
basisOfRecord
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.5 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 18 |
| Mean length | 18.01104712 |
| Min length | 18 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PRESERVED_SPECIMEN |
|---|---|
| 2nd row | PRESERVED_SPECIMEN |
| 3rd row | PRESERVED_SPECIMEN |
| 4th row | PRESERVED_SPECIMEN |
| 5th row | PRESERVED_SPECIMEN |
| Value | Count | Frequency (%) |
| preserved_specimen | 977483 | |
| machine_observation | 10919 | 1.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 4909253 | |
| R | 1965885 | |
| S | 1965885 | |
| P | 1954966 | 11.0% |
| I | 999321 | 5.6% |
| N | 999321 | 5.6% |
| V | 988402 | 5.6% |
| _ | 988402 | 5.6% |
| C | 988402 | 5.6% |
| M | 988402 | 5.6% |
| Other values (6) | 1053916 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 16813753 | |
| Connector Punctuation | 988402 | 5.6% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 4909253 | |
| R | 1965885 | |
| S | 1965885 | |
| P | 1954966 | 11.6% |
| I | 999321 | 5.9% |
| N | 999321 | 5.9% |
| V | 988402 | 5.9% |
| C | 988402 | 5.9% |
| M | 988402 | 5.9% |
| D | 977483 | 5.8% |
| Other values (5) | 76433 | 0.5% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 988402 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16813753 | |
| Common | 988402 | 5.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 4909253 | |
| R | 1965885 | |
| S | 1965885 | |
| P | 1954966 | 11.6% |
| I | 999321 | 5.9% |
| N | 999321 | 5.9% |
| V | 988402 | 5.9% |
| C | 988402 | 5.9% |
| M | 988402 | 5.9% |
| D | 977483 | 5.8% |
| Other values (5) | 76433 | 0.5% |
Common
| Value | Count | Frequency (%) |
| _ | 988402 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17802155 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 4909253 | |
| R | 1965885 | |
| S | 1965885 | |
| P | 1954966 | 11.0% |
| I | 999321 | 5.6% |
| N | 999321 | 5.6% |
| V | 988402 | 5.6% |
| _ | 988402 | 5.6% |
| C | 988402 | 5.6% |
| M | 988402 | 5.6% |
| Other values (6) | 1053916 | 5.9% |
occurrenceID
Text
Unique 
| Distinct | 988402 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.5 MiB |
Length
| Max length | 63 |
|---|---|
| Median length | 63 |
| Mean length | 63 |
| Min length | 63 |
Unique
| Unique | 988402 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | http://n2t.net/ark:/65665/383aab1ce-8b35-4007-8eba-472b592b7a99 |
|---|---|
| 2nd row | http://n2t.net/ark:/65665/3c8351e79-8b3b-4df0-80be-cb019ba60185 |
| 3rd row | http://n2t.net/ark:/65665/3c8377593-a51b-4b6a-835d-649053b2ef0f |
| 4th row | http://n2t.net/ark:/65665/383b388e9-b7cc-4b41-95cc-e0a1b092179a |
| 5th row | http://n2t.net/ark:/65665/3c83e5abc-b64e-45a4-aa42-faf5abc93792 |
| Value | Count | Frequency (%) |
| http://n2t.net/ark:/65665/383aab1ce-8b35-4007-8eba-472b592b7a99 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/384f1e549-df99-4ba7-87ad-271f1281c0f1 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/383de17db-7e58-4b17-9277-c255affc4cdb | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c8910e37-f290-4195-961c-13f0efedc290 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/383c2daca-1de6-4e79-9d0c-f3b5838bafb2 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c8377593-a51b-4b6a-835d-649053b2ef0f | 1 | < 0.1% |
| http://n2t.net/ark:/65665/383b388e9-b7cc-4b41-95cc-e0a1b092179a | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c83e5abc-b64e-45a4-aa42-faf5abc93792 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/3c83f60ef-2f0d-451e-986a-e0c2dfb03675 | 1 | < 0.1% |
| http://n2t.net/ark:/65665/383b6d73e-eb70-4b52-81b8-336878ca92f0 | 1 | < 0.1% |
| Other values (988392) | 988392 |
Most occurring characters
| Value | Count | Frequency (%) |
| / | 4942010 | 7.9% |
| 6 | 4818872 | 7.7% |
| - | 3953608 | 6.3% |
| t | 3953608 | 6.3% |
| 5 | 3831724 | 6.2% |
| a | 3087982 | 5.0% |
| 4 | 2844631 | 4.6% |
| e | 2842218 | 4.6% |
| 2 | 2841949 | 4.6% |
| 3 | 2841551 | 4.6% |
| Other values (16) | 26311173 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 26932854 | |
| Lowercase Letter | 23475648 | |
| Other Punctuation | 7907216 | 12.7% |
| Dash Punctuation | 3953608 | 6.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 3953608 | |
| a | 3087982 | |
| e | 2842218 | |
| b | 2099835 | |
| n | 1976804 | |
| f | 1854245 | |
| d | 1853885 | |
| c | 1853463 | |
| k | 988402 | 4.2% |
| r | 988402 | 4.2% |
| Other values (2) | 1976804 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 4818872 | |
| 5 | 3831724 | |
| 4 | 2844631 | |
| 2 | 2841949 | |
| 3 | 2841551 | |
| 9 | 2101095 | |
| 8 | 2097885 | |
| 7 | 1853396 | 6.9% |
| 0 | 1851151 | 6.9% |
| 1 | 1850600 | 6.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 4942010 | |
| : | 1976804 | 25.0% |
| . | 988402 | 12.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3953608 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 38793678 | |
| Latin | 23475648 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| / | 4942010 | |
| 6 | 4818872 | |
| - | 3953608 | |
| 5 | 3831724 | |
| 4 | 2844631 | |
| 2 | 2841949 | |
| 3 | 2841551 | |
| 9 | 2101095 | 5.4% |
| 8 | 2097885 | 5.4% |
| : | 1976804 | 5.1% |
| Other values (4) | 6543549 |
Latin
| Value | Count | Frequency (%) |
| t | 3953608 | |
| a | 3087982 | |
| e | 2842218 | |
| b | 2099835 | |
| n | 1976804 | |
| f | 1854245 | |
| d | 1853885 | |
| c | 1853463 | |
| k | 988402 | 4.2% |
| r | 988402 | 4.2% |
| Other values (2) | 1976804 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 62269326 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| / | 4942010 | 7.9% |
| 6 | 4818872 | 7.7% |
| - | 3953608 | 6.3% |
| t | 3953608 | 6.3% |
| 5 | 3831724 | 6.2% |
| a | 3087982 | 5.0% |
| 4 | 2844631 | 4.6% |
| e | 2842218 | 4.6% |
| 2 | 2841949 | 4.6% |
| 3 | 2841551 | 4.6% |
| Other values (16) | 26311173 |
catalogNumber
Text
Missing 
| Distinct | 843685 |
|---|---|
| Distinct (%) | 98.6% |
| Missing | 132504 |
| Missing (%) | 13.4% |
| Memory size | 7.5 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 10 |
| Mean length | 9.636065279 |
| Min length | 4 |
Unique
| Unique | 832040 ? |
|---|---|
| Unique (%) | 97.2% |
Sample
| 1st row | US 213621 |
|---|---|
| 2nd row | US 2144946 |
| 3rd row | US 3113222 |
| 4th row | US 2583825 |
| 5th row | US 3026466 |
| Value | Count | Frequency (%) |
| us | 846588 | |
| sem | 52 | < 0.1% |
| 1 | 35 | < 0.1% |
| 27 | < 0.1% | |
| micrograph | 26 | < 0.1% |
| stub | 26 | < 0.1% |
| 3 | 15 | < 0.1% |
| 2 | 13 | < 0.1% |
| 169920 | 12 | < 0.1% |
| 95489 | 9 | < 0.1% |
| Other values (843649) | 855865 |
Most occurring characters
| Value | Count | Frequency (%) |
| S | 855977 | |
| U | 855899 | |
| 846770 | ||
| 2 | 752400 | |
| 1 | 735998 | |
| 3 | 670736 | |
| 5 | 512863 | 6.2% |
| 4 | 511349 | 6.2% |
| 6 | 510981 | 6.2% |
| 7 | 501756 | 6.1% |
| Other values (36) | 1492760 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 5660089 | |
| Uppercase Letter | 1725893 | 20.9% |
| Space Separator | 846770 | 10.3% |
| Lowercase Letter | 9731 | 0.1% |
| Dash Punctuation | 4981 | 0.1% |
| Close Punctuation | 10 | < 0.1% |
| Open Punctuation | 10 | < 0.1% |
| Other Punctuation | 5 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| w | 9311 | |
| r | 53 | 0.5% |
| a | 42 | 0.4% |
| u | 36 | 0.4% |
| p | 30 | 0.3% |
| b | 29 | 0.3% |
| i | 28 | 0.3% |
| o | 27 | 0.3% |
| m | 27 | 0.3% |
| c | 27 | 0.3% |
| Other values (10) | 121 | 1.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 855977 | |
| U | 855899 | |
| D | 8488 | 0.5% |
| A | 5354 | 0.3% |
| E | 73 | < 0.1% |
| M | 52 | < 0.1% |
| P | 21 | < 0.1% |
| B | 18 | < 0.1% |
| L | 10 | < 0.1% |
| V | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 752400 | |
| 1 | 735998 | |
| 3 | 670736 | |
| 5 | 512863 | |
| 4 | 511349 | |
| 6 | 510981 | |
| 7 | 501756 | |
| 0 | 492646 | |
| 9 | 485776 | |
| 8 | 485584 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4 | |
| ? | 1 | 20.0% |
Space Separator
| Value | Count | Frequency (%) |
| 846770 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4981 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 10 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 10 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6511865 | |
| Latin | 1735624 | 21.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| S | 855977 | |
| U | 855899 | |
| w | 9311 | 0.5% |
| D | 8488 | 0.5% |
| A | 5354 | 0.3% |
| E | 73 | < 0.1% |
| r | 53 | < 0.1% |
| M | 52 | < 0.1% |
| a | 42 | < 0.1% |
| u | 36 | < 0.1% |
| Other values (20) | 339 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 846770 | ||
| 2 | 752400 | |
| 1 | 735998 | |
| 3 | 670736 | |
| 5 | 512863 | |
| 4 | 511349 | |
| 6 | 510981 | |
| 7 | 501756 | |
| 0 | 492646 | |
| 9 | 485776 | |
| Other values (6) | 490590 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8247489 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| S | 855977 | |
| U | 855899 | |
| 846770 | ||
| 2 | 752400 | |
| 1 | 735998 | |
| 3 | 670736 | |
| 5 | 512863 | 6.2% |
| 4 | 511349 | 6.2% |
| 6 | 510981 | 6.2% |
| 7 | 501756 | 6.1% |
| Other values (36) | 1492760 |
recordNumber
Text
| Distinct | 163293 |
|---|---|
| Distinct (%) | 16.7% |
| Missing | 8698 |
| Missing (%) | 0.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 90 |
|---|---|
| Median length | 72 |
| Mean length | 4.48941415 |
| Min length | 1 |
Unique
| Unique | 114501 ? |
|---|---|
| Unique (%) | 11.7% |
Sample
| 1st row | BLM-210-IV-11-B-TDS |
|---|---|
| 2nd row | 4319 |
| 3rd row | 2429 |
| 4th row | 95426 |
| 5th row | 1414/512 |
| Value | Count | Frequency (%) |
| s.n | 141397 | 13.6% |
| bureau | 4447 | 0.4% |
| eyd | 3365 | 0.3% |
| s | 3110 | 0.3% |
| n | 3006 | 0.3% |
| of | 2991 | 0.3% |
| science | 2898 | 0.3% |
| d&ml | 2806 | 0.3% |
| 2716 | 0.3% | |
| h | 1941 | 0.2% |
| Other values (128797) | 872266 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 527431 | |
| 2 | 407354 | |
| 3 | 350940 | 8.0% |
| 4 | 329711 | 7.5% |
| 5 | 316988 | 7.2% |
| 0 | 316877 | 7.2% |
| 6 | 306238 | 7.0% |
| . | 298537 | 6.8% |
| 7 | 287805 | 6.5% |
| 8 | 276699 | 6.3% |
| Other values (99) | 979717 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3389386 | |
| Lowercase Letter | 401596 | 9.1% |
| Other Punctuation | 316346 | 7.2% |
| Uppercase Letter | 158999 | 3.6% |
| Dash Punctuation | 66578 | 1.5% |
| Space Separator | 61239 | 1.4% |
| Open Punctuation | 1749 | < 0.1% |
| Close Punctuation | 1740 | < 0.1% |
| Other Number | 383 | < 0.1% |
| Connector Punctuation | 142 | < 0.1% |
| Other values (4) | 139 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 151013 | |
| s | 148056 | |
| a | 16112 | 4.0% |
| e | 15277 | 3.8% |
| u | 10511 | 2.6% |
| r | 10123 | 2.5% |
| c | 8774 | 2.2% |
| o | 8180 | 2.0% |
| i | 7692 | 1.9% |
| t | 6218 | 1.5% |
| Other values (26) | 19640 | 4.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 15372 | 9.7% |
| B | 15337 | 9.6% |
| S | 14472 | 9.1% |
| D | 11952 | 7.5% |
| H | 10454 | 6.6% |
| L | 9038 | 5.7% |
| M | 8771 | 5.5% |
| E | 8695 | 5.5% |
| I | 7475 | 4.7% |
| N | 6748 | 4.2% |
| Other values (18) | 50685 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 298537 | |
| & | 5375 | 1.7% |
| / | 5177 | 1.6% |
| * | 3127 | 1.0% |
| ? | 2292 | 0.7% |
| , | 1072 | 0.3% |
| ! | 476 | 0.2% |
| # | 84 | < 0.1% |
| ; | 74 | < 0.1% |
| : | 68 | < 0.1% |
| Other values (5) | 64 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 527431 | |
| 2 | 407354 | |
| 3 | 350940 | |
| 4 | 329711 | |
| 5 | 316988 | |
| 0 | 316877 | |
| 6 | 306238 | |
| 7 | 287805 | |
| 8 | 276699 | |
| 9 | 269343 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 368 | |
| ² | 6 | 1.6% |
| ¼ | 5 | 1.3% |
| ¾ | 3 | 0.8% |
| ⅓ | 1 | 0.3% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1585 | |
| [ | 105 | 6.0% |
| { | 59 | 3.4% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1577 | |
| ] | 104 | 6.0% |
| } | 59 | 3.4% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 112 | |
| + | 23 | 16.9% |
| ~ | 1 | 0.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 66578 |
Space Separator
| Value | Count | Frequency (%) |
| 61239 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 142 |
Modifier Letter
| Value | Count | Frequency (%) |
| ˍ | 1 |
Other Letter
| Value | Count | Frequency (%) |
| ª | 1 |
Final Punctuation
| Value | Count | Frequency (%) |
| › | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3837701 | |
| Latin | 560595 | 12.7% |
| Greek | 1 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| n | 151013 | |
| s | 148056 | |
| a | 16112 | 2.9% |
| A | 15372 | 2.7% |
| B | 15337 | 2.7% |
| e | 15277 | 2.7% |
| S | 14472 | 2.6% |
| D | 11952 | 2.1% |
| u | 10511 | 1.9% |
| H | 10454 | 1.9% |
| Other values (54) | 152039 |
Common
| Value | Count | Frequency (%) |
| 1 | 527431 | |
| 2 | 407354 | |
| 3 | 350940 | |
| 4 | 329711 | |
| 5 | 316988 | |
| 0 | 316877 | |
| 6 | 306238 | |
| . | 298537 | |
| 7 | 287805 | |
| 8 | 276699 | |
| Other values (34) | 419121 |
Greek
| Value | Count | Frequency (%) |
| Σ | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4397881 | |
| None | 412 | < 0.1% |
| Punctuation | 2 | < 0.1% |
| Modifier Letters | 1 | < 0.1% |
| Number Forms | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 527431 | |
| 2 | 407354 | |
| 3 | 350940 | 8.0% |
| 4 | 329711 | 7.5% |
| 5 | 316988 | 7.2% |
| 0 | 316877 | 7.2% |
| 6 | 306238 | 7.0% |
| . | 298537 | 6.8% |
| 7 | 287805 | 6.5% |
| 8 | 276699 | 6.3% |
| Other values (78) | 979301 |
None
| Value | Count | Frequency (%) |
| ½ | 368 | |
| è | 11 | 2.7% |
| ² | 6 | 1.5% |
| ¼ | 5 | 1.2% |
| ü | 4 | 1.0% |
| ¾ | 3 | 0.7% |
| é | 3 | 0.7% |
| ú | 2 | 0.5% |
| ó | 2 | 0.5% |
| á | 1 | 0.2% |
| Other values (7) | 7 | 1.7% |
Modifier Letters
| Value | Count | Frequency (%) |
| ˍ | 1 |
Number Forms
| Value | Count | Frequency (%) |
| ⅓ | 1 |
Punctuation
| Value | Count | Frequency (%) |
| … | 1 | |
| › | 1 |
recordedBy
Text
Missing 
| Distinct | 71729 |
|---|---|
| Distinct (%) | 7.3% |
| Missing | 11879 |
| Missing (%) | 1.2% |
| Memory size | 7.5 MiB |
Length
| Max length | 201 |
|---|---|
| Median length | 155 |
| Mean length | 17.24259848 |
| Min length | 1 |
Unique
| Unique | 36179 ? |
|---|---|
| Unique (%) | 3.7% |
Sample
| 1st row | Continental Shelf Associates for the MMS/BLM |
|---|---|
| 2nd row | J. Soukup |
| 3rd row | I. Morel |
| 4th row | J. Steyermark & Cora Steyermark |
| 5th row | A. Oakes & -. Ellis |
| Value | Count | Frequency (%) |
| 273336 | 7.3% | |
| j | 195095 | 5.2% |
| a | 167294 | 4.5% |
| r | 148560 | 4.0% |
| e | 148212 | 4.0% |
| c | 138644 | 3.7% |
| m | 133736 | 3.6% |
| h | 120329 | 3.2% |
| l | 97924 | 2.6% |
| w | 96924 | 2.6% |
| Other values (28460) | 2203562 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2747093 | ||
| . | 2002049 | 11.9% |
| e | 1082546 | 6.4% |
| r | 792877 | 4.7% |
| a | 787773 | 4.7% |
| o | 664253 | 3.9% |
| n | 660896 | 3.9% |
| l | 631169 | 3.7% |
| i | 544852 | 3.2% |
| t | 439669 | 2.6% |
| Other values (130) | 6484617 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8170236 | |
| Uppercase Letter | 3425777 | |
| Space Separator | 2747093 | 16.3% |
| Other Punctuation | 2413800 | 14.3% |
| Dash Punctuation | 73409 | 0.4% |
| Decimal Number | 2960 | < 0.1% |
| Close Punctuation | 2241 | < 0.1% |
| Open Punctuation | 2241 | < 0.1% |
| Math Symbol | 21 | < 0.1% |
| Modifier Symbol | 8 | < 0.1% |
| Other values (3) | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1082546 | |
| r | 792877 | |
| a | 787773 | |
| o | 664253 | 8.1% |
| n | 660896 | 8.1% |
| l | 631169 | 7.7% |
| i | 544852 | 6.7% |
| t | 439669 | 5.4% |
| s | 416260 | 5.1% |
| u | 253955 | 3.1% |
| Other values (60) | 1895986 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 262797 | 7.7% |
| S | 258602 | 7.5% |
| M | 242234 | 7.1% |
| R | 242073 | 7.1% |
| H | 238630 | 7.0% |
| A | 234211 | 6.8% |
| J | 232414 | 6.8% |
| E | 190208 | 5.6% |
| B | 181150 | 5.3% |
| L | 176618 | 5.2% |
| Other values (29) | 1166840 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2002049 | |
| & | 241593 | 10.0% |
| , | 166145 | 6.9% |
| ' | 2858 | 0.1% |
| / | 830 | < 0.1% |
| " | 314 | < 0.1% |
| ? | 6 | < 0.1% |
| ; | 3 | < 0.1% |
| : | 1 | < 0.1% |
| ¡ | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 560 | |
| 1 | 529 | |
| 9 | 528 | |
| 0 | 390 | |
| 4 | 297 | |
| 3 | 275 | |
| 5 | 246 | |
| 2 | 89 | 3.0% |
| 7 | 45 | 1.5% |
| 6 | 1 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2213 | |
| ] | 28 | 1.2% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2213 | |
| [ | 28 | 1.2% |
Space Separator
| Value | Count | Frequency (%) |
| 2747093 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 73409 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 21 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 8 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 4 |
Final Punctuation
| Value | Count | Frequency (%) |
| » | 2 |
Initial Punctuation
| Value | Count | Frequency (%) |
| « | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11596013 | |
| Common | 5241781 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1082546 | 9.3% |
| r | 792877 | 6.8% |
| a | 787773 | 6.8% |
| o | 664253 | 5.7% |
| n | 660896 | 5.7% |
| l | 631169 | 5.4% |
| i | 544852 | 4.7% |
| t | 439669 | 3.8% |
| s | 416260 | 3.6% |
| C | 262797 | 2.3% |
| Other values (99) | 5312921 |
Common
| Value | Count | Frequency (%) |
| 2747093 | ||
| . | 2002049 | |
| & | 241593 | 4.6% |
| , | 166145 | 3.2% |
| - | 73409 | 1.4% |
| ' | 2858 | 0.1% |
| ) | 2213 | < 0.1% |
| ( | 2213 | < 0.1% |
| / | 830 | < 0.1% |
| 8 | 560 | < 0.1% |
| Other values (21) | 2818 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 16783092 | |
| None | 54701 | 0.3% |
| IPA Ext | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2747093 | ||
| . | 2002049 | 11.9% |
| e | 1082546 | 6.5% |
| r | 792877 | 4.7% |
| a | 787773 | 4.7% |
| o | 664253 | 4.0% |
| n | 660896 | 3.9% |
| l | 631169 | 3.8% |
| i | 544852 | 3.2% |
| t | 439669 | 2.6% |
| Other values (68) | 6429915 |
None
| Value | Count | Frequency (%) |
| é | 9407 | |
| á | 9291 | |
| ó | 8471 | |
| í | 6255 | |
| ñ | 5412 | |
| è | 3789 | |
| ü | 3008 | 5.5% |
| ö | 2276 | 4.2% |
| ê | 1495 | 2.7% |
| ä | 701 | 1.3% |
| Other values (51) | 4596 |
IPA Ext
| Value | Count | Frequency (%) |
| ɶ | 1 |
individualCount
Text
| Distinct | 18 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 117 |
| Missing (%) | < 0.1% |
| Memory size | 7.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.00001113 |
| Min length | 1 |
Unique
| Unique | 6 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
| Value | Count | Frequency (%) |
| 1 | 988001 | |
| 2 | 114 | < 0.1% |
| 0 | 64 | < 0.1% |
| 3 | 35 | < 0.1% |
| 4 | 26 | < 0.1% |
| 5 | 14 | < 0.1% |
| 6 | 8 | < 0.1% |
| 9 | 5 | < 0.1% |
| 7 | 5 | < 0.1% |
| 11 | 3 | < 0.1% |
| Other values (8) | 10 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 988014 | |
| 2 | 117 | < 0.1% |
| 0 | 66 | < 0.1% |
| 3 | 35 | < 0.1% |
| 4 | 27 | < 0.1% |
| 5 | 14 | < 0.1% |
| 6 | 9 | < 0.1% |
| 9 | 6 | < 0.1% |
| 7 | 5 | < 0.1% |
| 8 | 3 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 988296 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 988014 | |
| 2 | 117 | < 0.1% |
| 0 | 66 | < 0.1% |
| 3 | 35 | < 0.1% |
| 4 | 27 | < 0.1% |
| 5 | 14 | < 0.1% |
| 6 | 9 | < 0.1% |
| 9 | 6 | < 0.1% |
| 7 | 5 | < 0.1% |
| 8 | 3 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 988296 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 988014 | |
| 2 | 117 | < 0.1% |
| 0 | 66 | < 0.1% |
| 3 | 35 | < 0.1% |
| 4 | 27 | < 0.1% |
| 5 | 14 | < 0.1% |
| 6 | 9 | < 0.1% |
| 9 | 6 | < 0.1% |
| 7 | 5 | < 0.1% |
| 8 | 3 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 988296 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 988014 | |
| 2 | 117 | < 0.1% |
| 0 | 66 | < 0.1% |
| 3 | 35 | < 0.1% |
| 4 | 27 | < 0.1% |
| 5 | 14 | < 0.1% |
| 6 | 9 | < 0.1% |
| 9 | 6 | < 0.1% |
| 7 | 5 | < 0.1% |
| 8 | 3 | < 0.1% |
lifeStage
Text
Missing 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 916836 |
| Missing (%) | 92.8% |
| Memory size | 7.5 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 8.755330744 |
| Min length | 8 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Fruiting |
|---|---|
| 2nd row | Flowering |
| 3rd row | Flowering |
| 4th row | Flowering |
| 5th row | Flowering |
| Value | Count | Frequency (%) |
| flowering | 43566 | |
| fruiting | 22755 | |
| vegetative | 5245 | 7.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 94321 | |
| g | 71566 | |
| F | 66321 | |
| r | 66321 | |
| n | 66321 | |
| e | 59301 | |
| l | 43566 | |
| o | 43566 | |
| w | 43566 | |
| t | 33245 | 5.3% |
| Other values (4) | 38490 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 555018 | |
| Uppercase Letter | 71566 | 11.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 94321 | |
| g | 71566 | |
| r | 66321 | |
| n | 66321 | |
| e | 59301 | |
| l | 43566 | |
| o | 43566 | |
| w | 43566 | |
| t | 33245 | 6.0% |
| u | 22755 | 4.1% |
| Other values (2) | 10490 | 1.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 66321 | |
| V | 5245 | 7.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 626584 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 94321 | |
| g | 71566 | |
| F | 66321 | |
| r | 66321 | |
| n | 66321 | |
| e | 59301 | |
| l | 43566 | |
| o | 43566 | |
| w | 43566 | |
| t | 33245 | 5.3% |
| Other values (4) | 38490 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 626584 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 94321 | |
| g | 71566 | |
| F | 66321 | |
| r | 66321 | |
| n | 66321 | |
| e | 59301 | |
| l | 43566 | |
| o | 43566 | |
| w | 43566 | |
| t | 33245 | 5.3% |
| Other values (4) | 38490 |
occurrenceStatus
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | PRESENT |
|---|---|
| 2nd row | PRESENT |
| 3rd row | PRESENT |
| 4th row | PRESENT |
| 5th row | PRESENT |
| Value | Count | Frequency (%) |
| present | 988402 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 1976804 | |
| P | 988402 | |
| R | 988402 | |
| S | 988402 | |
| N | 988402 | |
| T | 988402 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 6918814 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1976804 | |
| P | 988402 | |
| R | 988402 | |
| S | 988402 | |
| N | 988402 | |
| T | 988402 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6918814 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 1976804 | |
| P | 988402 | |
| R | 988402 | |
| S | 988402 | |
| N | 988402 | |
| T | 988402 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6918814 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 1976804 | |
| P | 988402 | |
| R | 988402 | |
| S | 988402 | |
| N | 988402 | |
| T | 988402 |
preparations
Text
Missing 
| Distinct | 77 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 959242 |
| Missing (%) | 97.0% |
| Memory size | 7.5 MiB |
Length
| Max length | 142 |
|---|---|
| Median length | 94 |
| Mean length | 13.18954047 |
| Min length | 3 |
Unique
| Unique | 26 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Wood Sample |
|---|---|
| 2nd row | Photograph |
| 3rd row | Microslide |
| 4th row | Photograph |
| 5th row | Photograph; Photograph |
| Value | Count | Frequency (%) |
| wood | 9236 | |
| sample | 9236 | |
| microslide | 8980 | |
| photograph | 7481 | |
| individual | 4028 | |
| strewn | 2184 | 4.5% |
| sem | 1492 | 3.0% |
| micrograph | 1411 | 2.9% |
| ink | 1139 | 2.3% |
| and | 637 | 1.3% |
| Other values (48) | 3129 | 6.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 44561 | 11.6% |
| i | 33828 | 8.8% |
| d | 27045 | 7.0% |
| a | 23965 | 6.2% |
| l | 23811 | 6.2% |
| r | 23041 | 6.0% |
| e | 21867 | 5.7% |
| 19793 | 5.1% | |
| p | 18828 | 4.9% |
| h | 16434 | 4.3% |
| Other values (33) | 131434 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 310828 | |
| Uppercase Letter | 41028 | 10.7% |
| Space Separator | 19793 | 5.1% |
| Close Punctuation | 6212 | 1.6% |
| Open Punctuation | 6212 | 1.6% |
| Other Punctuation | 534 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 44561 | |
| i | 33828 | |
| d | 27045 | |
| a | 23965 | 7.7% |
| l | 23811 | 7.7% |
| r | 23041 | 7.4% |
| e | 21867 | 7.0% |
| p | 18828 | 6.1% |
| h | 16434 | 5.3% |
| s | 11393 | 3.7% |
| Other values (16) | 66055 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 10846 | |
| M | 10477 | |
| W | 9389 | |
| P | 7465 | |
| E | 1537 | 3.7% |
| B | 589 | 1.4% |
| I | 507 | 1.2% |
| F | 147 | 0.4% |
| D | 52 | 0.1% |
| T | 16 | < 0.1% |
| Other values (2) | 3 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 527 | |
| , | 7 | 1.3% |
Space Separator
| Value | Count | Frequency (%) |
| 19793 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 6212 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 6212 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 351856 | |
| Common | 32751 | 8.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 44561 | |
| i | 33828 | 9.6% |
| d | 27045 | 7.7% |
| a | 23965 | 6.8% |
| l | 23811 | 6.8% |
| r | 23041 | 6.5% |
| e | 21867 | 6.2% |
| p | 18828 | 5.4% |
| h | 16434 | 4.7% |
| s | 11393 | 3.2% |
| Other values (28) | 107083 |
Common
| Value | Count | Frequency (%) |
| 19793 | ||
| ) | 6212 | 19.0% |
| ( | 6212 | 19.0% |
| ; | 527 | 1.6% |
| , | 7 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 384607 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 44561 | 11.6% |
| i | 33828 | 8.8% |
| d | 27045 | 7.0% |
| a | 23965 | 6.2% |
| l | 23811 | 6.2% |
| r | 23041 | 6.0% |
| e | 21867 | 5.7% |
| 19793 | 5.1% | |
| p | 18828 | 4.9% |
| h | 16434 | 4.3% |
| Other values (33) | 131434 |
Missing 
| Distinct | 73 |
|---|---|
| Distinct (%) | 98.6% |
| Missing | 988328 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 249 |
|---|---|
| Median length | 199 |
| Mean length | 146.972973 |
| Min length | 49 |
Unique
| Unique | 72 ? |
|---|---|
| Unique (%) | 97.3% |
Sample
| 1st row | https://www.ncbi.nlm.nih.gov/gquery?term=ON553270 |
|---|---|
| 2nd row | https://www.ncbi.nlm.nih.gov/gquery?term=MT553291 |
| 3rd row | https://www.ncbi.nlm.nih.gov/gquery?term=MT553246 |
| 4th row | https://www.ncbi.nlm.nih.gov/gquery?term=MT553283 |
| 5th row | https://www.ncbi.nlm.nih.gov/gquery?term=EU527211;https://www.ncbi.nlm.nih.gov/gquery?term=EU527308;https://www.ncbi.nlm.nih.gov/gquery?term=EU527261 |
| Value | Count | Frequency (%) |
| https://www.ncbi.nlm.nih.gov/gquery?term=jn837181;https://www.ncbi.nlm.nih.gov/gquery?term=jn837465;https://www.ncbi.nlm.nih.gov/gquery?term=jn837361;https://www.ncbi.nlm.nih.gov/gquery?term=jn837271 | 2 | 2.7% |
| https://www.ncbi.nlm.nih.gov/gquery?term=jn837116;https://www.ncbi.nlm.nih.gov/gquery?term=jn837405;https://www.ncbi.nlm.nih.gov/gquery?term=jn837297;https://www.ncbi.nlm.nih.gov/gquery?term=jn837206 | 1 | 1.4% |
| https://www.ncbi.nlm.nih.gov/gquery?term=mt553291 | 1 | 1.4% |
| https://www.ncbi.nlm.nih.gov/gquery?term=mt553246 | 1 | 1.4% |
| https://www.ncbi.nlm.nih.gov/gquery?term=mt553283 | 1 | 1.4% |
| https://www.ncbi.nlm.nih.gov/gquery?term=eu527211;https://www.ncbi.nlm.nih.gov/gquery?term=eu527308;https://www.ncbi.nlm.nih.gov/gquery?term=eu527261 | 1 | 1.4% |
| https://www.ncbi.nlm.nih.gov/gquery?term=kf989590;https://www.ncbi.nlm.nih.gov/gquery?term=kf989809;https://www.ncbi.nlm.nih.gov/gquery?term=kf990009;https://www.ncbi.nlm.nih.gov/gquery?term=kf989698 | 1 | 1.4% |
| https://www.ncbi.nlm.nih.gov/gquery?term=jn837113;https://www.ncbi.nlm.nih.gov/gquery?term=jn837294;https://www.ncbi.nlm.nih.gov/gquery?term=jn837203 | 1 | 1.4% |
| https://www.ncbi.nlm.nih.gov/gquery?term=kc986936 | 1 | 1.4% |
| https://www.ncbi.nlm.nih.gov/gquery?term=eu527225;https://www.ncbi.nlm.nih.gov/gquery?term=eu527322;https://www.ncbi.nlm.nih.gov/gquery?term=eu527275 | 1 | 1.4% |
| Other values (63) | 63 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 876 | 8.1% |
| t | 657 | 6.0% |
| / | 657 | 6.0% |
| w | 657 | 6.0% |
| n | 657 | 6.0% |
| h | 438 | 4.0% |
| i | 438 | 4.0% |
| r | 438 | 4.0% |
| e | 438 | 4.0% |
| g | 438 | 4.0% |
| Other values (38) | 5182 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6789 | |
| Other Punctuation | 2116 | 19.5% |
| Decimal Number | 1314 | 12.1% |
| Uppercase Letter | 438 | 4.0% |
| Math Symbol | 219 | 2.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 657 | 9.7% |
| w | 657 | 9.7% |
| n | 657 | 9.7% |
| h | 438 | 6.5% |
| i | 438 | 6.5% |
| r | 438 | 6.5% |
| e | 438 | 6.5% |
| g | 438 | 6.5% |
| m | 438 | 6.5% |
| l | 219 | 3.2% |
| Other values (9) | 1971 |
Uppercase Letter
| Value | Count | Frequency (%) |
| K | 99 | |
| F | 97 | |
| N | 66 | |
| J | 57 | |
| E | 24 | 5.5% |
| U | 24 | 5.5% |
| M | 21 | 4.8% |
| T | 17 | 3.9% |
| Y | 10 | 2.3% |
| A | 9 | 2.1% |
| Other values (3) | 14 | 3.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 271 | |
| 8 | 198 | |
| 3 | 159 | |
| 7 | 159 | |
| 5 | 139 | |
| 2 | 121 | |
| 0 | 75 | 5.7% |
| 1 | 74 | 5.6% |
| 6 | 65 | 4.9% |
| 4 | 53 | 4.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 876 | |
| / | 657 | |
| : | 219 | 10.3% |
| ? | 219 | 10.3% |
| ; | 145 | 6.9% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 219 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7227 | |
| Common | 3649 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 657 | 9.1% |
| w | 657 | 9.1% |
| n | 657 | 9.1% |
| h | 438 | 6.1% |
| i | 438 | 6.1% |
| r | 438 | 6.1% |
| e | 438 | 6.1% |
| g | 438 | 6.1% |
| m | 438 | 6.1% |
| l | 219 | 3.0% |
| Other values (22) | 2409 |
Common
| Value | Count | Frequency (%) |
| . | 876 | |
| / | 657 | |
| 9 | 271 | 7.4% |
| : | 219 | 6.0% |
| = | 219 | 6.0% |
| ? | 219 | 6.0% |
| 8 | 198 | 5.4% |
| 3 | 159 | 4.4% |
| 7 | 159 | 4.4% |
| ; | 145 | 4.0% |
| Other values (6) | 527 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10876 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 876 | 8.1% |
| t | 657 | 6.0% |
| / | 657 | 6.0% |
| w | 657 | 6.0% |
| n | 657 | 6.0% |
| h | 438 | 4.0% |
| i | 438 | 4.0% |
| r | 438 | 4.0% |
| e | 438 | 4.0% |
| g | 438 | 4.0% |
| Other values (38) | 5182 |
Missing 
| Distinct | 7579 |
|---|---|
| Distinct (%) | 37.9% |
| Missing | 968411 |
| Missing (%) | 98.0% |
| Memory size | 7.5 MiB |
Length
| Max length | 4263 |
|---|---|
| Median length | 2214 |
| Mean length | 78.05767595 |
| Min length | 1 |
Unique
| Unique | 6837 ? |
|---|---|
| Unique (%) | 34.2% |
Sample
| 1st row | Received as: seed |
|---|---|
| 2nd row | Transcribed by digital volunteers |
| 3rd row | BRG |
| 4th row | Transcribed by digital volunteers; Original spelling as annotated and published is "subplebeia". Same (?) taxon re-published in Contr. U.S. Natl. Herb. 17: 46 (1913) with more explicit type citation. Unclear whether Lecidea subplebeia is a later homonym of Lecidea subplebeja Vain. (1890); Lecidea austrocalifornica Zahlbr. published as replacement name but citing Lecidea "subplebeja Nyl. apud Hasse". The latter name is superfluous if the original name is not a later homonym. |
| 5th row | US, NY |
| Value | Count | Frequency (%) |
| by | 8401 | 3.7% |
| transcribed | 6608 | 2.9% |
| digital | 6534 | 2.9% |
| volunteers | 6533 | 2.9% |
| 4336 | 1.9% | |
| of | 3855 | 1.7% |
| us | 3164 | 1.4% |
| as | 3111 | 1.4% |
| and | 2908 | 1.3% |
| the | 2877 | 1.3% |
| Other values (18932) | 177871 |
Most occurring characters
| Value | Count | Frequency (%) |
| 206207 | 13.2% | |
| e | 125859 | 8.1% |
| a | 97880 | 6.3% |
| i | 90227 | 5.8% |
| t | 77282 | 5.0% |
| n | 75398 | 4.8% |
| o | 74864 | 4.8% |
| r | 73280 | 4.7% |
| l | 65793 | 4.2% |
| s | 59459 | 3.8% |
| Other values (113) | 614202 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1046064 | |
| Space Separator | 206207 | 13.2% |
| Uppercase Letter | 132849 | 8.5% |
| Other Punctuation | 83260 | 5.3% |
| Decimal Number | 69248 | 4.4% |
| Dash Punctuation | 8074 | 0.5% |
| Open Punctuation | 7113 | 0.5% |
| Close Punctuation | 7104 | 0.5% |
| Connector Punctuation | 195 | < 0.1% |
| Math Symbol | 153 | < 0.1% |
| Other values (5) | 184 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 125859 | |
| a | 97880 | 9.4% |
| i | 90227 | 8.6% |
| t | 77282 | 7.4% |
| n | 75398 | 7.2% |
| o | 74864 | 7.2% |
| r | 73280 | 7.0% |
| l | 65793 | 6.3% |
| s | 59459 | 5.7% |
| c | 43851 | 4.2% |
| Other values (32) | 262171 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 13820 | 10.4% |
| T | 12273 | 9.2% |
| C | 11400 | 8.6% |
| A | 10232 | 7.7% |
| B | 8549 | 6.4% |
| P | 6979 | 5.3% |
| F | 6440 | 4.8% |
| R | 6027 | 4.5% |
| H | 5879 | 4.4% |
| M | 5726 | 4.3% |
| Other values (18) | 45524 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 36042 | |
| , | 24281 | |
| ; | 8664 | 10.4% |
| : | 5897 | 7.1% |
| " | 4572 | 5.5% |
| & | 1629 | 2.0% |
| ' | 1020 | 1.2% |
| / | 444 | 0.5% |
| ? | 313 | 0.4% |
| # | 227 | 0.3% |
| Other values (7) | 171 | 0.2% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 15281 | |
| 9 | 9550 | |
| 2 | 7514 | |
| 0 | 7154 | |
| 3 | 5592 | 8.1% |
| 8 | 5486 | 7.9% |
| 4 | 4963 | 7.2% |
| 7 | 4813 | 7.0% |
| 5 | 4717 | 6.8% |
| 6 | 4178 | 6.0% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 100 | |
| + | 29 | 19.0% |
| × | 13 | 8.5% |
| ~ | 4 | 2.6% |
| > | 3 | 2.0% |
| < | 3 | 2.0% |
| | | 1 | 0.7% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 7974 | |
| – | 98 | 1.2% |
| — | 2 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 6683 | |
| [ | 429 | 6.0% |
| { | 1 | < 0.1% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ́ | 48 | |
| ̀ | 16 | 20.0% |
| ̧ | 16 | 20.0% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 21 | |
| © | 5 | 17.9% |
| ♂ | 2 | 7.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 6679 | |
| ] | 425 | 6.0% |
Space Separator
| Value | Count | Frequency (%) |
| 206207 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 195 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 38 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 35 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1178913 | |
| Common | 381458 | 24.4% |
| Inherited | 80 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 125859 | 10.7% |
| a | 97880 | 8.3% |
| i | 90227 | 7.7% |
| t | 77282 | 6.6% |
| n | 75398 | 6.4% |
| o | 74864 | 6.4% |
| r | 73280 | 6.2% |
| l | 65793 | 5.6% |
| s | 59459 | 5.0% |
| c | 43851 | 3.7% |
| Other values (60) | 395020 |
Common
| Value | Count | Frequency (%) |
| 206207 | ||
| . | 36042 | 9.4% |
| , | 24281 | 6.4% |
| 1 | 15281 | 4.0% |
| 9 | 9550 | 2.5% |
| ; | 8664 | 2.3% |
| - | 7974 | 2.1% |
| 2 | 7514 | 2.0% |
| 0 | 7154 | 1.9% |
| ( | 6683 | 1.8% |
| Other values (40) | 52108 | 13.7% |
Inherited
| Value | Count | Frequency (%) |
| ́ | 48 | |
| ̀ | 16 | 20.0% |
| ̧ | 16 | 20.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1559562 | |
| None | 622 | < 0.1% |
| Punctuation | 185 | < 0.1% |
| Diacriticals | 80 | < 0.1% |
| Misc Symbols | 2 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 206207 | 13.2% | |
| e | 125859 | 8.1% |
| a | 97880 | 6.3% |
| i | 90227 | 5.8% |
| t | 77282 | 5.0% |
| n | 75398 | 4.8% |
| o | 74864 | 4.8% |
| r | 73280 | 4.7% |
| l | 65793 | 4.2% |
| s | 59459 | 3.8% |
| Other values (80) | 613313 |
None
| Value | Count | Frequency (%) |
| í | 184 | |
| é | 133 | |
| ñ | 79 | |
| á | 53 | 8.5% |
| è | 26 | 4.2% |
| ç | 25 | 4.0% |
| ó | 22 | 3.5% |
| ° | 21 | 3.4% |
| ü | 16 | 2.6% |
| × | 13 | 2.1% |
| Other values (13) | 50 | 8.0% |
Punctuation
| Value | Count | Frequency (%) |
| – | 98 | |
| ” | 38 | 20.5% |
| “ | 35 | 18.9% |
| • | 8 | 4.3% |
| … | 4 | 2.2% |
| — | 2 | 1.1% |
Diacriticals
| Value | Count | Frequency (%) |
| ́ | 48 | |
| ̀ | 16 | 20.0% |
| ̧ | 16 | 20.0% |
Misc Symbols
| Value | Count | Frequency (%) |
| ♂ | 2 |
fieldNumber
Text
Missing 
| Distinct | 5 |
|---|---|
| Distinct (%) | 8.5% |
| Missing | 988343 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 9 |
| Mean length | 9.322033898 |
| Min length | 9 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 6.8% |
Sample
| 1st row | Sample OY |
|---|---|
| 2nd row | Sample OY |
| 3rd row | Sample OY |
| 4th row | Sample OY |
| 5th row | Sample OY |
| Value | Count | Frequency (%) |
| sample | 55 | |
| oy | 55 | |
| a | 2 | 1.7% |
| u.s | 1 | 0.8% |
| virgin | 1 | 0.8% |
| islands | 1 | 0.8% |
| alakai_220 | 1 | 0.8% |
| koolau_784 | 1 | 0.8% |
| koolau | 1 | 0.8% |
| 850 | 1 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 62 | |
| 61 | ||
| l | 59 | |
| S | 56 | |
| m | 55 | |
| p | 55 | |
| e | 55 | |
| O | 55 | |
| Y | 55 | |
| o | 4 | 0.7% |
| Other values (22) | 33 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 304 | |
| Uppercase Letter | 172 | |
| Space Separator | 61 | 11.1% |
| Decimal Number | 9 | 1.6% |
| Connector Punctuation | 2 | 0.4% |
| Other Punctuation | 2 | 0.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 62 | |
| l | 59 | |
| m | 55 | |
| p | 55 | |
| e | 55 | |
| o | 4 | 1.3% |
| i | 3 | 1.0% |
| u | 2 | 0.7% |
| s | 2 | 0.7% |
| n | 2 | 0.7% |
| Other values (5) | 5 | 1.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 56 | |
| O | 55 | |
| Y | 55 | |
| K | 2 | 1.2% |
| I | 1 | 0.6% |
| A | 1 | 0.6% |
| V | 1 | 0.6% |
| U | 1 | 0.6% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 8 | 2 | |
| 0 | 2 | |
| 7 | 1 | |
| 5 | 1 | |
| 4 | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 61 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 476 | |
| Common | 74 | 13.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 62 | |
| l | 59 | |
| S | 56 | |
| m | 55 | |
| p | 55 | |
| e | 55 | |
| O | 55 | |
| Y | 55 | |
| o | 4 | 0.8% |
| i | 3 | 0.6% |
| Other values (13) | 17 | 3.6% |
Common
| Value | Count | Frequency (%) |
| 61 | ||
| _ | 2 | 2.7% |
| 2 | 2 | 2.7% |
| 8 | 2 | 2.7% |
| 0 | 2 | 2.7% |
| . | 2 | 2.7% |
| 7 | 1 | 1.4% |
| 5 | 1 | 1.4% |
| 4 | 1 | 1.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 550 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 62 | |
| 61 | ||
| l | 59 | |
| S | 56 | |
| m | 55 | |
| p | 55 | |
| e | 55 | |
| O | 55 | |
| Y | 55 | |
| o | 4 | 0.7% |
| Other values (22) | 33 |
eventDate
Text
Missing 
| Distinct | 66958 |
|---|---|
| Distinct (%) | 7.7% |
| Missing | 119809 |
| Missing (%) | 12.1% |
| Memory size | 7.5 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 10 |
| Mean length | 10.01351151 |
| Min length | 4 |
Unique
| Unique | 11342 ? |
|---|---|
| Unique (%) | 1.3% |
Sample
| 1st row | 1981-04-30 |
|---|---|
| 2nd row | 1954-08-07 |
| 3rd row | 1947-04-03 |
| 4th row | 1966-04-01 |
| 5th row | 1971-03-23 |
| Value | Count | Frequency (%) |
| 1891 | 1085 | 0.1% |
| 1923 | 918 | 0.1% |
| 1922 | 844 | 0.1% |
| 1889 | 814 | 0.1% |
| 1885 | 814 | 0.1% |
| 1892 | 772 | 0.1% |
| 1890 | 762 | 0.1% |
| 1897 | 759 | 0.1% |
| 1880 | 756 | 0.1% |
| 1875 | 745 | 0.1% |
| Other values (66948) | 860324 |
Most occurring characters
| Value | Count | Frequency (%) |
| - | 1652713 | |
| 1 | 1649641 | |
| 0 | 1325270 | |
| 9 | 1106125 | |
| 2 | 658306 | 7.6% |
| 8 | 502398 | 5.8% |
| 7 | 370750 | 4.3% |
| 6 | 370722 | 4.3% |
| 3 | 354814 | 4.1% |
| 5 | 329527 | 3.8% |
| Other values (2) | 377400 | 4.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6991922 | |
| Dash Punctuation | 1652713 | 19.0% |
| Other Punctuation | 53031 | 0.6% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1649641 | |
| 0 | 1325270 | |
| 9 | 1106125 | |
| 2 | 658306 | 9.4% |
| 8 | 502398 | 7.2% |
| 7 | 370750 | 5.3% |
| 6 | 370722 | 5.3% |
| 3 | 354814 | 5.1% |
| 5 | 329527 | 4.7% |
| 4 | 324369 | 4.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1652713 |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 53031 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8697666 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 1652713 | |
| 1 | 1649641 | |
| 0 | 1325270 | |
| 9 | 1106125 | |
| 2 | 658306 | 7.6% |
| 8 | 502398 | 5.8% |
| 7 | 370750 | 4.3% |
| 6 | 370722 | 4.3% |
| 3 | 354814 | 4.1% |
| 5 | 329527 | 3.8% |
| Other values (2) | 377400 | 4.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8697666 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| - | 1652713 | |
| 1 | 1649641 | |
| 0 | 1325270 | |
| 9 | 1106125 | |
| 2 | 658306 | 7.6% |
| 8 | 502398 | 5.8% |
| 7 | 370750 | 4.3% |
| 6 | 370722 | 4.3% |
| 3 | 354814 | 4.1% |
| 5 | 329527 | 3.8% |
| Other values (2) | 377400 | 4.3% |
startDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 261666 |
| Missing (%) | 26.5% |
| Memory size | 7.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.775570496 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 120 |
|---|---|
| 2nd row | 219 |
| 3rd row | 93 |
| 4th row | 91 |
| 5th row | 82 |
| Value | Count | Frequency (%) |
| 201 | 3860 | 0.5% |
| 200 | 3710 | 0.5% |
| 196 | 3699 | 0.5% |
| 210 | 3653 | 0.5% |
| 199 | 3644 | 0.5% |
| 206 | 3635 | 0.5% |
| 209 | 3596 | 0.5% |
| 208 | 3571 | 0.5% |
| 197 | 3518 | 0.5% |
| 205 | 3509 | 0.5% |
| Other values (356) | 690341 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 402055 | |
| 1 | 395875 | |
| 3 | 232241 | |
| 5 | 148144 | 7.3% |
| 4 | 147792 | 7.3% |
| 0 | 141087 | 7.0% |
| 6 | 140313 | 7.0% |
| 9 | 139529 | 6.9% |
| 8 | 135484 | 6.7% |
| 7 | 134587 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2017107 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 402055 | |
| 1 | 395875 | |
| 3 | 232241 | |
| 5 | 148144 | 7.3% |
| 4 | 147792 | 7.3% |
| 0 | 141087 | 7.0% |
| 6 | 140313 | 7.0% |
| 9 | 139529 | 6.9% |
| 8 | 135484 | 6.7% |
| 7 | 134587 | 6.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2017107 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 402055 | |
| 1 | 395875 | |
| 3 | 232241 | |
| 5 | 148144 | 7.3% |
| 4 | 147792 | 7.3% |
| 0 | 141087 | 7.0% |
| 6 | 140313 | 7.0% |
| 9 | 139529 | 6.9% |
| 8 | 135484 | 6.7% |
| 7 | 134587 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2017107 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 402055 | |
| 1 | 395875 | |
| 3 | 232241 | |
| 5 | 148144 | 7.3% |
| 4 | 147792 | 7.3% |
| 0 | 141087 | 7.0% |
| 6 | 140313 | 7.0% |
| 9 | 139529 | 6.9% |
| 8 | 135484 | 6.7% |
| 7 | 134587 | 6.7% |
endDayOfYear
Text
Missing 
| Distinct | 366 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 261666 |
| Missing (%) | 26.5% |
| Memory size | 7.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 2.776592876 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 120 |
|---|---|
| 2nd row | 219 |
| 3rd row | 93 |
| 4th row | 91 |
| 5th row | 82 |
| Value | Count | Frequency (%) |
| 201 | 3878 | 0.5% |
| 200 | 3781 | 0.5% |
| 210 | 3758 | 0.5% |
| 199 | 3668 | 0.5% |
| 206 | 3643 | 0.5% |
| 196 | 3642 | 0.5% |
| 209 | 3624 | 0.5% |
| 208 | 3616 | 0.5% |
| 197 | 3589 | 0.5% |
| 205 | 3564 | 0.5% |
| Other values (356) | 689973 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 403054 | |
| 1 | 395145 | |
| 3 | 232842 | |
| 5 | 148228 | 7.3% |
| 4 | 147942 | 7.3% |
| 0 | 141306 | 7.0% |
| 6 | 139307 | 6.9% |
| 9 | 138440 | 6.9% |
| 8 | 136018 | 6.7% |
| 7 | 135568 | 6.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2017850 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 403054 | |
| 1 | 395145 | |
| 3 | 232842 | |
| 5 | 148228 | 7.3% |
| 4 | 147942 | 7.3% |
| 0 | 141306 | 7.0% |
| 6 | 139307 | 6.9% |
| 9 | 138440 | 6.9% |
| 8 | 136018 | 6.7% |
| 7 | 135568 | 6.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2017850 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 403054 | |
| 1 | 395145 | |
| 3 | 232842 | |
| 5 | 148228 | 7.3% |
| 4 | 147942 | 7.3% |
| 0 | 141306 | 7.0% |
| 6 | 139307 | 6.9% |
| 9 | 138440 | 6.9% |
| 8 | 136018 | 6.7% |
| 7 | 135568 | 6.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2017850 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 403054 | |
| 1 | 395145 | |
| 3 | 232842 | |
| 5 | 148228 | 7.3% |
| 4 | 147942 | 7.3% |
| 0 | 141306 | 7.0% |
| 6 | 139307 | 6.9% |
| 9 | 138440 | 6.9% |
| 8 | 136018 | 6.7% |
| 7 | 135568 | 6.7% |
year
Text
Missing 
| Distinct | 250 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 122319 |
| Missing (%) | 12.4% |
| Memory size | 7.5 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 13 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 1981 |
|---|---|
| 2nd row | 1954 |
| 3rd row | 1947 |
| 4th row | 1966 |
| 5th row | 1971 |
| Value | Count | Frequency (%) |
| 1966 | 11485 | 1.3% |
| 1964 | 11177 | 1.3% |
| 1939 | 10631 | 1.2% |
| 1929 | 9967 | 1.2% |
| 1949 | 9934 | 1.1% |
| 1938 | 9757 | 1.1% |
| 1965 | 9721 | 1.1% |
| 1962 | 9422 | 1.1% |
| 1922 | 9238 | 1.1% |
| 1968 | 9163 | 1.1% |
| Other values (240) | 765588 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 985189 | |
| 9 | 894393 | |
| 8 | 298773 | 8.6% |
| 0 | 225639 | 6.5% |
| 2 | 208677 | 6.0% |
| 6 | 191651 | 5.5% |
| 4 | 170693 | 4.9% |
| 3 | 166238 | 4.8% |
| 7 | 162371 | 4.7% |
| 5 | 160708 | 4.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3464332 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 985189 | |
| 9 | 894393 | |
| 8 | 298773 | 8.6% |
| 0 | 225639 | 6.5% |
| 2 | 208677 | 6.0% |
| 6 | 191651 | 5.5% |
| 4 | 170693 | 4.9% |
| 3 | 166238 | 4.8% |
| 7 | 162371 | 4.7% |
| 5 | 160708 | 4.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3464332 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 985189 | |
| 9 | 894393 | |
| 8 | 298773 | 8.6% |
| 0 | 225639 | 6.5% |
| 2 | 208677 | 6.0% |
| 6 | 191651 | 5.5% |
| 4 | 170693 | 4.9% |
| 3 | 166238 | 4.8% |
| 7 | 162371 | 4.7% |
| 5 | 160708 | 4.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3464332 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 985189 | |
| 9 | 894393 | |
| 8 | 298773 | 8.6% |
| 0 | 225639 | 6.5% |
| 2 | 208677 | 6.0% |
| 6 | 191651 | 5.5% |
| 4 | 170693 | 4.9% |
| 3 | 166238 | 4.8% |
| 7 | 162371 | 4.7% |
| 5 | 160708 | 4.6% |
month
Text
Missing 
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 181983 |
| Missing (%) | 18.4% |
| Memory size | 7.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 1 |
| Mean length | 1.17061354 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 4 |
|---|---|
| 2nd row | 8 |
| 3rd row | 4 |
| 4th row | 4 |
| 5th row | 3 |
| Value | Count | Frequency (%) |
| 7 | 116222 | |
| 8 | 105467 | |
| 6 | 87311 | |
| 5 | 73413 | |
| 9 | 72707 | |
| 4 | 61870 | |
| 3 | 56406 | |
| 10 | 54634 | |
| 2 | 49486 | |
| 1 | 45951 | 5.7% |
| Other values (2) | 82952 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 227383 | |
| 7 | 116222 | |
| 8 | 105467 | |
| 2 | 88592 | 9.4% |
| 6 | 87311 | 9.2% |
| 5 | 73413 | 7.8% |
| 9 | 72707 | 7.7% |
| 4 | 61870 | 6.6% |
| 3 | 56406 | 6.0% |
| 0 | 54634 | 5.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 944005 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 227383 | |
| 7 | 116222 | |
| 8 | 105467 | |
| 2 | 88592 | 9.4% |
| 6 | 87311 | 9.2% |
| 5 | 73413 | 7.8% |
| 9 | 72707 | 7.7% |
| 4 | 61870 | 6.6% |
| 3 | 56406 | 6.0% |
| 0 | 54634 | 5.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 944005 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 227383 | |
| 7 | 116222 | |
| 8 | 105467 | |
| 2 | 88592 | 9.4% |
| 6 | 87311 | 9.2% |
| 5 | 73413 | 7.8% |
| 9 | 72707 | 7.7% |
| 4 | 61870 | 6.6% |
| 3 | 56406 | 6.0% |
| 0 | 54634 | 5.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 944005 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 227383 | |
| 7 | 116222 | |
| 8 | 105467 | |
| 2 | 88592 | 9.4% |
| 6 | 87311 | 9.2% |
| 5 | 73413 | 7.8% |
| 9 | 72707 | 7.7% |
| 4 | 61870 | 6.6% |
| 3 | 56406 | 6.0% |
| 0 | 54634 | 5.8% |
day
Text
Missing 
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 314697 |
| Missing (%) | 31.8% |
| Memory size | 7.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 1.713809457 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 30 |
|---|---|
| 2nd row | 7 |
| 3rd row | 3 |
| 4th row | 1 |
| 5th row | 23 |
| Value | Count | Frequency (%) |
| 20 | 24963 | 3.7% |
| 15 | 24514 | 3.6% |
| 18 | 23599 | 3.5% |
| 10 | 23434 | 3.5% |
| 19 | 22891 | 3.4% |
| 25 | 22886 | 3.4% |
| 17 | 22629 | 3.4% |
| 23 | 22542 | 3.3% |
| 24 | 22331 | 3.3% |
| 21 | 22292 | 3.3% |
| Other values (21) | 441624 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 303094 | |
| 2 | 288122 | |
| 3 | 97141 | 8.4% |
| 5 | 69071 | 6.0% |
| 0 | 68565 | 5.9% |
| 8 | 67525 | 5.8% |
| 7 | 66156 | 5.7% |
| 4 | 65473 | 5.7% |
| 6 | 65354 | 5.7% |
| 9 | 64101 | 5.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1154602 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 303094 | |
| 2 | 288122 | |
| 3 | 97141 | 8.4% |
| 5 | 69071 | 6.0% |
| 0 | 68565 | 5.9% |
| 8 | 67525 | 5.8% |
| 7 | 66156 | 5.7% |
| 4 | 65473 | 5.7% |
| 6 | 65354 | 5.7% |
| 9 | 64101 | 5.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1154602 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 303094 | |
| 2 | 288122 | |
| 3 | 97141 | 8.4% |
| 5 | 69071 | 6.0% |
| 0 | 68565 | 5.9% |
| 8 | 67525 | 5.8% |
| 7 | 66156 | 5.7% |
| 4 | 65473 | 5.7% |
| 6 | 65354 | 5.7% |
| 9 | 64101 | 5.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1154602 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 303094 | |
| 2 | 288122 | |
| 3 | 97141 | 8.4% |
| 5 | 69071 | 6.0% |
| 0 | 68565 | 5.9% |
| 8 | 67525 | 5.8% |
| 7 | 66156 | 5.7% |
| 4 | 65473 | 5.7% |
| 6 | 65354 | 5.7% |
| 9 | 64101 | 5.6% |
Missing 
| Distinct | 83121 |
|---|---|
| Distinct (%) | 25.0% |
| Missing | 655426 |
| Missing (%) | 66.3% |
| Memory size | 7.5 MiB |
Length
| Max length | 69610 |
|---|---|
| Median length | 11 |
| Mean length | 13.58990137 |
| Min length | 1 |
Unique
| Unique | 34904 ? |
|---|---|
| Unique (%) | 10.5% |
Sample
| 1st row | 30 Apr 1981 |
|---|---|
| 2nd row | 16 Dec 1953 |
| 3rd row | -- --- ---- |
| 4th row | 01 Feb 1974 |
| 5th row | Transcribed d/m/y: 28/4/76 |
| Value | Count | Frequency (%) |
| 124747 | 12.0% | |
| transcribed | 35815 | 3.5% |
| d/m/y | 35815 | 3.5% |
| jul | 29191 | 2.8% |
| aug | 27927 | 2.7% |
| may | 22223 | 2.1% |
| sep | 22121 | 2.1% |
| jun | 22087 | 2.1% |
| to | 19593 | 1.9% |
| apr | 19397 | 1.9% |
| Other values (27964) | 676941 |
Most occurring characters
| Value | Count | Frequency (%) |
| 698931 | 15.4% | |
| 1 | 452053 | 10.0% |
| - | 374626 | 8.3% |
| 9 | 327456 | 7.2% |
| 2 | 199891 | 4.4% |
| 0 | 167707 | 3.7% |
| 8 | 147074 | 3.3% |
| / | 146437 | 3.2% |
| r | 129266 | 2.9% |
| e | 109856 | 2.4% |
| Other values (91) | 1771814 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1788765 | |
| Lowercase Letter | 1136371 | |
| Space Separator | 698931 | 15.4% |
| Dash Punctuation | 374626 | 8.3% |
| Uppercase Letter | 323132 | 7.1% |
| Other Punctuation | 189357 | 4.2% |
| Control | 12953 | 0.3% |
| Connector Punctuation | 644 | < 0.1% |
| Open Punctuation | 162 | < 0.1% |
| Close Punctuation | 162 | < 0.1% |
| Other values (4) | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 129266 | |
| e | 109856 | 9.7% |
| a | 108149 | 9.5% |
| u | 93732 | 8.2% |
| n | 82604 | 7.3% |
| d | 73958 | 6.5% |
| c | 72996 | 6.4% |
| y | 64680 | 5.7% |
| b | 61967 | 5.5% |
| p | 49005 | 4.3% |
| Other values (27) | 290158 |
Uppercase Letter
| Value | Count | Frequency (%) |
| J | 76067 | |
| A | 54199 | |
| M | 46856 | |
| T | 36885 | |
| S | 26921 | 8.3% |
| F | 21380 | 6.6% |
| O | 20504 | 6.3% |
| N | 17433 | 5.4% |
| D | 14938 | 4.6% |
| E | 1440 | 0.4% |
| Other values (19) | 6509 | 2.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| / | 146437 | |
| : | 36756 | 19.4% |
| , | 2937 | 1.6% |
| . | 2690 | 1.4% |
| ' | 164 | 0.1% |
| ? | 142 | 0.1% |
| ! | 112 | 0.1% |
| ; | 57 | < 0.1% |
| & | 31 | < 0.1% |
| " | 16 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 452053 | |
| 9 | 327456 | |
| 2 | 199891 | |
| 0 | 167707 | 9.4% |
| 8 | 147074 | 8.2% |
| 6 | 107431 | 6.0% |
| 3 | 104037 | 5.8% |
| 4 | 95148 | 5.3% |
| 7 | 94134 | 5.3% |
| 5 | 93834 | 5.2% |
Control
| Value | Count | Frequency (%) |
| 12895 | ||
| 58 | 0.4% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 102 | |
| [ | 60 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 102 | |
| ] | 60 |
Math Symbol
| Value | Count | Frequency (%) |
| = | 3 | |
| × | 2 |
Space Separator
| Value | Count | Frequency (%) |
| 698931 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 374626 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 644 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 1 |
Other Number
| Value | Count | Frequency (%) |
| ½ | 1 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3065608 | |
| Latin | 1459503 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 129266 | 8.9% |
| e | 109856 | 7.5% |
| a | 108149 | 7.4% |
| u | 93732 | 6.4% |
| n | 82604 | 5.7% |
| J | 76067 | 5.2% |
| d | 73958 | 5.1% |
| c | 72996 | 5.0% |
| y | 64680 | 4.4% |
| b | 61967 | 4.2% |
| Other values (56) | 586228 |
Common
| Value | Count | Frequency (%) |
| 698931 | ||
| 1 | 452053 | |
| - | 374626 | |
| 9 | 327456 | |
| 2 | 199891 | 6.5% |
| 0 | 167707 | 5.5% |
| 8 | 147074 | 4.8% |
| / | 146437 | 4.8% |
| 6 | 107431 | 3.5% |
| 3 | 104037 | 3.4% |
| Other values (25) | 339965 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4525057 | |
| None | 53 | < 0.1% |
| Punctuation | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 698931 | 15.4% | |
| 1 | 452053 | 10.0% |
| - | 374626 | 8.3% |
| 9 | 327456 | 7.2% |
| 2 | 199891 | 4.4% |
| 0 | 167707 | 3.7% |
| 8 | 147074 | 3.3% |
| / | 146437 | 3.2% |
| r | 129266 | 2.9% |
| e | 109856 | 2.4% |
| Other values (73) | 1771760 |
None
| Value | Count | Frequency (%) |
| é | 15 | |
| í | 6 | 11.3% |
| ó | 6 | 11.3% |
| á | 5 | 9.4% |
| ô | 3 | 5.7% |
| û | 3 | 5.7% |
| Æ | 3 | 5.7% |
| ü | 2 | 3.8% |
| × | 2 | 3.8% |
| ° | 1 | 1.9% |
| Other values (7) | 7 |
Punctuation
| Value | Count | Frequency (%) |
| ” | 1 |
habitat
Text
Missing 
| Distinct | 54569 |
|---|---|
| Distinct (%) | 49.4% |
| Missing | 877971 |
| Missing (%) | 88.8% |
| Memory size | 7.5 MiB |
Length
| Max length | 567 |
|---|---|
| Median length | 292 |
| Mean length | 33.5132979 |
| Min length | 1 |
Unique
| Unique | 43682 ? |
|---|---|
| Unique (%) | 39.6% |
Sample
| 1st row | Erect. |
|---|---|
| 2nd row | Planted |
| 3rd row | Hillsides covered with broad-leaved forest, understory with Arthrostylidium, Rubus, and numerous ferns, epiphytes and Usnea. |
| 4th row | Open to closed forest with Pinus contorta, Populus tremuloides, Purshia tridentata, and Ribes cereum. |
| 5th row | Deep secondary forest; clay soil |
| Value | Count | Frequency (%) |
| forest | 28539 | 5.0% |
| on | 19755 | 3.5% |
| and | 16182 | 2.8% |
| in | 14715 | 2.6% |
| with | 11801 | 2.1% |
| of | 10905 | 1.9% |
| along | 6428 | 1.1% |
| de | 6077 | 1.1% |
| soil | 5416 | 1.0% |
| sand | 4830 | 0.8% |
| Other values (19599) | 444497 |
Most occurring characters
| Value | Count | Frequency (%) |
| 458714 | ||
| e | 334644 | 9.0% |
| a | 293407 | 7.9% |
| o | 267034 | 7.2% |
| r | 232740 | 6.3% |
| s | 232487 | 6.3% |
| n | 229134 | 6.2% |
| i | 195932 | 5.3% |
| t | 185312 | 5.0% |
| l | 144970 | 3.9% |
| Other values (119) | 1126533 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2956785 | |
| Space Separator | 458714 | 12.4% |
| Uppercase Letter | 161024 | 4.4% |
| Other Punctuation | 101816 | 2.8% |
| Decimal Number | 9235 | 0.2% |
| Dash Punctuation | 8003 | 0.2% |
| Close Punctuation | 2295 | 0.1% |
| Open Punctuation | 2276 | 0.1% |
| Math Symbol | 699 | < 0.1% |
| Other Symbol | 30 | < 0.1% |
| Other values (6) | 30 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 334644 | |
| a | 293407 | |
| o | 267034 | 9.0% |
| r | 232740 | 7.9% |
| s | 232487 | 7.9% |
| n | 229134 | 7.7% |
| i | 195932 | 6.6% |
| t | 185312 | 6.3% |
| l | 144970 | 4.9% |
| d | 142531 | 4.8% |
| Other values (41) | 698594 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 19589 | |
| M | 14835 | 9.2% |
| C | 11216 | 7.0% |
| P | 10463 | 6.5% |
| O | 10200 | 6.3% |
| A | 9926 | 6.2% |
| R | 9817 | 6.1% |
| D | 9128 | 5.7% |
| B | 8873 | 5.5% |
| F | 8509 | 5.3% |
| Other values (19) | 48468 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 45008 | |
| . | 43899 | |
| ; | 6228 | 6.1% |
| & | 2296 | 2.3% |
| : | 1271 | 1.2% |
| / | 1232 | 1.2% |
| " | 931 | 0.9% |
| ' | 519 | 0.5% |
| % | 153 | 0.2% |
| ? | 145 | 0.1% |
| Other values (6) | 134 | 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2807 | |
| 3 | 1186 | |
| 1 | 1155 | |
| 5 | 1136 | |
| 2 | 1092 | 11.8% |
| 4 | 740 | 8.0% |
| 6 | 387 | 4.2% |
| 8 | 304 | 3.3% |
| 7 | 223 | 2.4% |
| 9 | 205 | 2.2% |
Math Symbol
| Value | Count | Frequency (%) |
| ~ | 423 | |
| | | 121 | 17.3% |
| + | 82 | 11.7% |
| ± | 50 | 7.2% |
| = | 14 | 2.0% |
| > | 6 | 0.9% |
| < | 3 | 0.4% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2189 | |
| ] | 69 | 3.0% |
| } | 37 | 1.6% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2176 | |
| [ | 63 | 2.8% |
| { | 37 | 1.6% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 7994 | |
| – | 9 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 458714 |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 30 |
Final Punctuation
| Value | Count | Frequency (%) |
| ” | 10 |
Initial Punctuation
| Value | Count | Frequency (%) |
| “ | 7 |
Other Letter
| Value | Count | Frequency (%) |
| º | 5 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 5 |
Other Number
| Value | Count | Frequency (%) |
| ² | 2 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3117814 | |
| Common | 583093 | 15.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 334644 | 10.7% |
| a | 293407 | 9.4% |
| o | 267034 | 8.6% |
| r | 232740 | 7.5% |
| s | 232487 | 7.5% |
| n | 229134 | 7.3% |
| i | 195932 | 6.3% |
| t | 185312 | 5.9% |
| l | 144970 | 4.6% |
| d | 142531 | 4.6% |
| Other values (71) | 859623 |
Common
| Value | Count | Frequency (%) |
| 458714 | ||
| , | 45008 | 7.7% |
| . | 43899 | 7.5% |
| - | 7994 | 1.4% |
| ; | 6228 | 1.1% |
| 0 | 2807 | 0.5% |
| & | 2296 | 0.4% |
| ) | 2189 | 0.4% |
| ( | 2176 | 0.4% |
| : | 1271 | 0.2% |
| Other values (38) | 10511 | 1.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3694009 | |
| None | 6846 | 0.2% |
| Punctuation | 52 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 458714 | ||
| e | 334644 | 9.1% |
| a | 293407 | 7.9% |
| o | 267034 | 7.2% |
| r | 232740 | 6.3% |
| s | 232487 | 6.3% |
| n | 229134 | 6.2% |
| i | 195932 | 5.3% |
| t | 185312 | 5.0% |
| l | 144970 | 3.9% |
| Other values (82) | 1119635 |
None
| Value | Count | Frequency (%) |
| ú | 1030 | |
| ê | 1022 | |
| é | 989 | |
| ó | 939 | |
| í | 817 | |
| á | 696 | |
| ñ | 546 | |
| è | 322 | 4.7% |
| à | 132 | 1.9% |
| ã | 56 | 0.8% |
| Other values (23) | 297 | 4.3% |
Punctuation
| Value | Count | Frequency (%) |
| … | 26 | |
| ” | 10 | 19.2% |
| – | 9 | 17.3% |
| “ | 7 | 13.5% |
locationID
Text
Missing 
| Distinct | 667 |
|---|---|
| Distinct (%) | 7.4% |
| Missing | 979422 |
| Missing (%) | 99.1% |
| Memory size | 7.5 MiB |
Length
| Max length | 35 |
|---|---|
| Median length | 5 |
| Mean length | 6.010690423 |
| Min length | 1 |
Unique
| Unique | 202 ? |
|---|---|
| Unique (%) | 2.2% |
Sample
| 1st row | 66-10 |
|---|---|
| 2nd row | 69-11 |
| 3rd row | 64-51 |
| 4th row | 66-14 |
| 5th row | 64-34 |
| Value | Count | Frequency (%) |
| station | 1070 | 10.3% |
| ms04 | 374 | 3.6% |
| 66-24 | 305 | 2.9% |
| 61 | 200 | 1.9% |
| 64-47 | 131 | 1.3% |
| 64-48 | 130 | 1.3% |
| 69-14 | 124 | 1.2% |
| 64-45 | 98 | 0.9% |
| 66-28 | 92 | 0.9% |
| 64-06 | 90 | 0.9% |
| Other values (654) | 7783 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 9454 | |
| - | 7849 | |
| 4 | 4641 | 8.6% |
| 2 | 4263 | 7.9% |
| 1 | 3970 | 7.4% |
| 0 | 3323 | 6.2% |
| 3 | 2445 | 4.5% |
| 7 | 2280 | 4.2% |
| t | 2194 | 4.1% |
| S | 1651 | 3.1% |
| Other values (55) | 11906 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 34512 | |
| Dash Punctuation | 7849 | 14.5% |
| Lowercase Letter | 6872 | 12.7% |
| Uppercase Letter | 3196 | 5.9% |
| Space Separator | 1417 | 2.6% |
| Connector Punctuation | 69 | 0.1% |
| Close Punctuation | 26 | < 0.1% |
| Open Punctuation | 26 | < 0.1% |
| Other Punctuation | 9 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1651 | |
| M | 410 | 12.8% |
| A | 247 | 7.7% |
| I | 170 | 5.3% |
| K | 150 | 4.7% |
| N | 107 | 3.3% |
| T | 74 | 2.3% |
| H | 62 | 1.9% |
| O | 51 | 1.6% |
| B | 40 | 1.3% |
| Other values (15) | 234 | 7.3% |
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 2194 | |
| n | 1126 | |
| o | 1120 | |
| i | 1107 | |
| a | 1103 | |
| e | 56 | 0.8% |
| r | 27 | 0.4% |
| l | 25 | 0.4% |
| s | 21 | 0.3% |
| d | 20 | 0.3% |
| Other values (10) | 73 | 1.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 9454 | |
| 4 | 4641 | |
| 2 | 4263 | |
| 1 | 3970 | |
| 0 | 3323 | 9.6% |
| 3 | 2445 | 7.1% |
| 7 | 2280 | 6.6% |
| 8 | 1650 | 4.8% |
| 9 | 1276 | 3.7% |
| 5 | 1210 | 3.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 6 | |
| / | 2 | 22.2% |
| & | 1 | 11.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 25 | |
| ] | 1 | 3.8% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 25 | |
| [ | 1 | 3.8% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 7849 |
Space Separator
| Value | Count | Frequency (%) |
| 1417 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 69 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 43908 | |
| Latin | 10068 | 18.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 2194 | |
| S | 1651 | |
| n | 1126 | |
| o | 1120 | |
| i | 1107 | |
| a | 1103 | |
| M | 410 | 4.1% |
| A | 247 | 2.5% |
| I | 170 | 1.7% |
| K | 150 | 1.5% |
| Other values (35) | 790 | 7.8% |
Common
| Value | Count | Frequency (%) |
| 6 | 9454 | |
| - | 7849 | |
| 4 | 4641 | |
| 2 | 4263 | |
| 1 | 3970 | |
| 0 | 3323 | 7.6% |
| 3 | 2445 | 5.6% |
| 7 | 2280 | 5.2% |
| 8 | 1650 | 3.8% |
| 1417 | 3.2% | |
| Other values (10) | 2616 | 6.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 53976 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 9454 | |
| - | 7849 | |
| 4 | 4641 | 8.6% |
| 2 | 4263 | 7.9% |
| 1 | 3970 | 7.4% |
| 0 | 3323 | 6.2% |
| 3 | 2445 | 4.5% |
| 7 | 2280 | 4.2% |
| t | 2194 | 4.1% |
| S | 1651 | 3.1% |
| Other values (55) | 11906 |
higherGeography
Text
| Distinct | 17498 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 8448 |
| Missing (%) | 0.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 133 |
|---|---|
| Median length | 113 |
| Mean length | 40.94448923 |
| Min length | 5 |
Unique
| Unique | 6237 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | North America, United States, Florida |
|---|---|
| 2nd row | South America - Neotropics, Peru, Piura |
| 3rd row | South America, Argentina, Formosa |
| 4th row | South America - Neotropics, Venezuela, Carabobo |
| 5th row | Africa, South Africa |
| Value | Count | Frequency (%) |
| america | 664608 | 12.5% |
| north | 382460 | 7.2% |
| 365184 | 6.8% | |
| neotropics | 351203 | 6.6% |
| united | 295755 | 5.5% |
| states | 293830 | 5.5% |
| south | 254482 | 4.8% |
| mexico | 71903 | 1.3% |
| asia-tropical | 66600 | 1.2% |
| brazil | 65997 | 1.2% |
| Other values (10459) | 2522178 |
Most occurring characters
| Value | Count | Frequency (%) |
| 4354246 | 10.9% | |
| a | 3701367 | 9.2% |
| i | 2990303 | 7.5% |
| e | 2937990 | 7.3% |
| r | 2542856 | 6.3% |
| t | 2505817 | 6.2% |
| o | 2445015 | 6.1% |
| , | 2049394 | 5.1% |
| n | 1579289 | 3.9% |
| c | 1556529 | 3.9% |
| Other values (124) | 13460910 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 28096131 | |
| Uppercase Letter | 5068430 | 12.6% |
| Space Separator | 4354246 | 10.9% |
| Other Punctuation | 2092855 | 5.2% |
| Dash Punctuation | 493187 | 1.2% |
| Open Punctuation | 9360 | < 0.1% |
| Close Punctuation | 9360 | < 0.1% |
| Modifier Letter | 104 | < 0.1% |
| Modifier Symbol | 25 | < 0.1% |
| Decimal Number | 17 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3701367 | |
| i | 2990303 | |
| e | 2937990 | |
| r | 2542856 | |
| t | 2505817 | |
| o | 2445015 | |
| n | 1579289 | 5.6% |
| c | 1556529 | 5.5% |
| s | 1514118 | 5.4% |
| m | 951564 | 3.4% |
| Other values (58) | 5371283 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 996984 | |
| N | 827492 | |
| S | 695744 | |
| C | 398218 | 7.9% |
| U | 330028 | 6.5% |
| M | 216529 | 4.3% |
| P | 200273 | 4.0% |
| I | 183128 | 3.6% |
| T | 183052 | 3.6% |
| B | 152770 | 3.0% |
| Other values (33) | 884212 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2049394 | |
| . | 29432 | 1.4% |
| ' | 8898 | 0.4% |
| / | 4747 | 0.2% |
| ? | 346 | < 0.1% |
| & | 33 | < 0.1% |
| ; | 2 | < 0.1% |
| ¡ | 2 | < 0.1% |
| \ | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 8 | |
| 2 | 7 | |
| 9 | 1 | 5.9% |
| 6 | 1 | 5.9% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5733 | |
| [ | 3627 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5733 | |
| ] | 3627 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 91 | |
| ʼ | 13 | 12.5% |
Space Separator
| Value | Count | Frequency (%) |
| 4354246 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 493187 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 25 |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ́ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 33164561 | |
| Common | 6959154 | 17.3% |
| Inherited | 1 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3701367 | 11.2% |
| i | 2990303 | 9.0% |
| e | 2937990 | 8.9% |
| r | 2542856 | 7.7% |
| t | 2505817 | 7.6% |
| o | 2445015 | 7.4% |
| n | 1579289 | 4.8% |
| c | 1556529 | 4.7% |
| s | 1514118 | 4.6% |
| A | 996984 | 3.0% |
| Other values (101) | 10394293 |
Common
| Value | Count | Frequency (%) |
| 4354246 | ||
| , | 2049394 | |
| - | 493187 | 7.1% |
| . | 29432 | 0.4% |
| ' | 8898 | 0.1% |
| ( | 5733 | 0.1% |
| ) | 5733 | 0.1% |
| / | 4747 | 0.1% |
| [ | 3627 | 0.1% |
| ] | 3627 | 0.1% |
| Other values (12) | 530 | < 0.1% |
Inherited
| Value | Count | Frequency (%) |
| ́ | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 40018791 | |
| None | 104815 | 0.3% |
| Modifier Letters | 104 | < 0.1% |
| Latin Ext Additional | 5 | < 0.1% |
| Diacriticals | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 4354246 | 10.9% | |
| a | 3701367 | 9.2% |
| i | 2990303 | 7.5% |
| e | 2937990 | 7.3% |
| r | 2542856 | 6.4% |
| t | 2505817 | 6.3% |
| o | 2445015 | 6.1% |
| , | 2049394 | 5.1% |
| n | 1579289 | 3.9% |
| c | 1556529 | 3.9% |
| Other values (60) | 13355985 |
None
| Value | Count | Frequency (%) |
| á | 34764 | |
| í | 19904 | |
| é | 17687 | |
| ó | 12612 | 12.0% |
| ã | 6382 | 6.1% |
| ô | 3029 | 2.9% |
| ç | 1904 | 1.8% |
| ñ | 1606 | 1.5% |
| Î | 1479 | 1.4% |
| ü | 1115 | 1.1% |
| Other values (49) | 4333 | 4.1% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 91 | |
| ʼ | 13 | 12.5% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ṭ | 4 | |
| ồ | 1 | 20.0% |
Diacriticals
| Value | Count | Frequency (%) |
| ́ | 1 |
continent
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 32788 |
| Missing (%) | 3.3% |
| Memory size | 7.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 11.06743099 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SOUTH_AMERICA |
|---|---|
| 2nd row | SOUTH_AMERICA |
| 3rd row | SOUTH_AMERICA |
| 4th row | AFRICA |
| 5th row | SOUTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 482446 | |
| south_america | 235730 | |
| asia | 113249 | 11.9% |
| europe | 50324 | 5.3% |
| oceania | 37414 | 3.9% |
| africa | 35361 | 3.7% |
| antarctica | 1090 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 1811670 | |
| R | 1287397 | |
| I | 905290 | |
| E | 856238 | |
| O | 805914 | |
| C | 793131 | |
| T | 720356 | 6.8% |
| H | 718176 | 6.8% |
| _ | 718176 | 6.8% |
| M | 718176 | 6.8% |
| Other values (5) | 1241668 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 9858016 | |
| Connector Punctuation | 718176 | 6.8% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 1811670 | |
| R | 1287397 | |
| I | 905290 | |
| E | 856238 | |
| O | 805914 | |
| C | 793131 | |
| T | 720356 | 7.3% |
| H | 718176 | 7.3% |
| M | 718176 | 7.3% |
| N | 520950 | 5.3% |
| Other values (4) | 720718 | 7.3% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 718176 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9858016 | |
| Common | 718176 | 6.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 1811670 | |
| R | 1287397 | |
| I | 905290 | |
| E | 856238 | |
| O | 805914 | |
| C | 793131 | |
| T | 720356 | 7.3% |
| H | 718176 | 7.3% |
| M | 718176 | 7.3% |
| N | 520950 | 5.3% |
| Other values (4) | 720718 | 7.3% |
Common
| Value | Count | Frequency (%) |
| _ | 718176 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10576192 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 1811670 | |
| R | 1287397 | |
| I | 905290 | |
| E | 856238 | |
| O | 805914 | |
| C | 793131 | |
| T | 720356 | 6.8% |
| H | 718176 | 6.8% |
| _ | 718176 | 6.8% |
| M | 718176 | 6.8% |
| Other values (5) | 1241668 |
waterBody
Text
Missing 
| Distinct | 75 |
|---|---|
| Distinct (%) | 1.8% |
| Missing | 984227 |
| Missing (%) | 99.6% |
| Memory size | 7.5 MiB |
Length
| Max length | 62 |
|---|---|
| Median length | 61 |
| Mean length | 25.99209581 |
| Min length | 8 |
Unique
| Unique | 25 ? |
|---|---|
| Unique (%) | 0.6% |
Sample
| 1st row | North Atlantic Ocean, Bay of Fundy |
|---|---|
| 2nd row | North Atlantic Ocean, Caribbean Sea |
| 3rd row | North Atlantic Ocean, Gulf of Maine, Englishman Bay/Mack Cove |
| 4th row | North Atlantic Ocean, Caribbean Sea |
| 5th row | North Pacific Ocean |
| Value | Count | Frequency (%) |
| ocean | 3353 | |
| north | 3226 | |
| atlantic | 3034 | |
| sea | 1523 | |
| caribbean | 1284 | 7.6% |
| of | 757 | 4.5% |
| gulf | 720 | 4.3% |
| maine | 576 | 3.4% |
| bay | 526 | 3.1% |
| pacific | 275 | 1.6% |
| Other values (74) | 1519 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 12644 | |
| 12618 | ||
| t | 9811 | 9.0% |
| n | 8916 | 8.2% |
| e | 7679 | 7.1% |
| c | 7268 | 6.7% |
| i | 5876 | 5.4% |
| r | 4960 | 4.6% |
| o | 4857 | 4.5% |
| l | 4068 | 3.7% |
| Other values (42) | 29820 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 77476 | |
| Uppercase Letter | 16118 | 14.9% |
| Space Separator | 12618 | 11.6% |
| Other Punctuation | 2214 | 2.0% |
| Modifier Letter | 91 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 12644 | |
| t | 9811 | |
| n | 8916 | |
| e | 7679 | |
| c | 7268 | |
| i | 5876 | |
| r | 4960 | 6.4% |
| o | 4857 | 6.3% |
| l | 4068 | 5.3% |
| h | 3689 | 4.8% |
| Other values (16) | 7708 |
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 3400 | |
| N | 3226 | |
| A | 3059 | |
| S | 1750 | |
| C | 1606 | |
| G | 802 | 5.0% |
| B | 681 | 4.2% |
| M | 657 | 4.1% |
| P | 418 | 2.6% |
| I | 123 | 0.8% |
| Other values (11) | 396 | 2.5% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2113 | |
| / | 83 | 3.7% |
| ' | 18 | 0.8% |
Space Separator
| Value | Count | Frequency (%) |
| 12618 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 91 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 93594 | |
| Common | 14923 | 13.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 12644 | |
| t | 9811 | |
| n | 8916 | 9.5% |
| e | 7679 | 8.2% |
| c | 7268 | 7.8% |
| i | 5876 | 6.3% |
| r | 4960 | 5.3% |
| o | 4857 | 5.2% |
| l | 4068 | 4.3% |
| h | 3689 | 3.9% |
| Other values (37) | 23826 |
Common
| Value | Count | Frequency (%) |
| 12618 | ||
| , | 2113 | 14.2% |
| ʻ | 91 | 0.6% |
| / | 83 | 0.6% |
| ' | 18 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 108335 | |
| Modifier Letters | 91 | 0.1% |
| None | 91 | 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 12644 | |
| 12618 | ||
| t | 9811 | 9.1% |
| n | 8916 | 8.2% |
| e | 7679 | 7.1% |
| c | 7268 | 6.7% |
| i | 5876 | 5.4% |
| r | 4960 | 4.6% |
| o | 4857 | 4.5% |
| l | 4068 | 3.8% |
| Other values (40) | 29638 |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 91 |
None
| Value | Count | Frequency (%) |
| ā | 91 |
islandGroup
Text
Missing 
| Distinct | 362 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 963568 |
| Missing (%) | 97.5% |
| Memory size | 7.5 MiB |
Length
| Max length | 42 |
|---|---|
| Median length | 39 |
| Mean length | 14.85515825 |
| Min length | 5 |
Unique
| Unique | 85 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Greater Antilles |
|---|---|
| 2nd row | Greater Antilles |
| 3rd row | Elizabeth Islands |
| 4th row | Channel Islands |
| 5th row | Greater Antilles |
| Value | Count | Frequency (%) |
| greater | 7095 | 12.5% |
| antilles | 7095 | 12.5% |
| islands | 5085 | 9.0% |
| is | 4355 | 7.7% |
| group | 3620 | 6.4% |
| new | 1627 | 2.9% |
| guinea | 1329 | 2.3% |
| keys | 1172 | 2.1% |
| channel | 1169 | 2.1% |
| florida | 1110 | 2.0% |
| Other values (325) | 23144 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 37198 | 10.1% |
| a | 33600 | 9.1% |
| 31967 | 8.7% | |
| s | 29203 | 7.9% |
| l | 28135 | 7.6% |
| r | 26374 | 7.1% |
| n | 24868 | 6.7% |
| t | 19411 | 5.3% |
| i | 18406 | 5.0% |
| G | 13341 | 3.6% |
| Other values (52) | 106410 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 275978 | |
| Uppercase Letter | 55169 | 15.0% |
| Space Separator | 31967 | 8.7% |
| Other Punctuation | 4509 | 1.2% |
| Open Punctuation | 643 | 0.2% |
| Close Punctuation | 643 | 0.2% |
| Dash Punctuation | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 37198 | |
| a | 33600 | |
| s | 29203 | |
| l | 28135 | |
| r | 26374 | |
| n | 24868 | |
| t | 19411 | |
| i | 18406 | |
| u | 11144 | 4.0% |
| d | 10677 | 3.9% |
| Other values (17) | 36962 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 13341 | |
| I | 10292 | |
| A | 8068 | |
| C | 3484 | 6.3% |
| V | 3078 | 5.6% |
| L | 2538 | 4.6% |
| N | 2119 | 3.8% |
| S | 1813 | 3.3% |
| K | 1439 | 2.6% |
| F | 1319 | 2.4% |
| Other values (15) | 7678 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4348 | |
| ' | 155 | 3.4% |
| , | 4 | 0.1% |
| ? | 2 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 411 | |
| ( | 232 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 411 | |
| ) | 232 |
Space Separator
| Value | Count | Frequency (%) |
| 31967 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 331147 | |
| Common | 37766 | 10.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 37198 | |
| a | 33600 | 10.1% |
| s | 29203 | 8.8% |
| l | 28135 | 8.5% |
| r | 26374 | 8.0% |
| n | 24868 | 7.5% |
| t | 19411 | 5.9% |
| i | 18406 | 5.6% |
| G | 13341 | 4.0% |
| u | 11144 | 3.4% |
| Other values (42) | 89467 |
Common
| Value | Count | Frequency (%) |
| 31967 | ||
| . | 4348 | 11.5% |
| [ | 411 | 1.1% |
| ] | 411 | 1.1% |
| ( | 232 | 0.6% |
| ) | 232 | 0.6% |
| ' | 155 | 0.4% |
| , | 4 | < 0.1% |
| - | 4 | < 0.1% |
| ? | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 366915 | |
| None | 1998 | 0.5% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 37198 | 10.1% |
| a | 33600 | 9.2% |
| 31967 | 8.7% | |
| s | 29203 | 8.0% |
| l | 28135 | 7.7% |
| r | 26374 | 7.2% |
| n | 24868 | 6.8% |
| t | 19411 | 5.3% |
| i | 18406 | 5.0% |
| G | 13341 | 3.6% |
| Other values (50) | 104412 |
None
| Value | Count | Frequency (%) |
| Î | 1085 | |
| á | 913 |
island
Text
Missing 
| Distinct | 2614 |
|---|---|
| Distinct (%) | 3.2% |
| Missing | 906001 |
| Missing (%) | 91.7% |
| Memory size | 7.5 MiB |
Length
| Max length | 48 |
|---|---|
| Median length | 43 |
| Mean length | 9.546267642 |
| Min length | 2 |
Unique
| Unique | 943 ? |
|---|---|
| Unique (%) | 1.1% |
Sample
| 1st row | Rota |
|---|---|
| 2nd row | Hispaniola |
| 3rd row | North Island |
| 4th row | Kaua'i |
| 5th row | Hispaniola Island |
| Value | Count | Frequency (%) |
| hispaniola | 10778 | 8.5% |
| island | 9771 | 7.7% |
| cuba | 4961 | 3.9% |
| oahu | 3726 | 2.9% |
| st | 2657 | 2.1% |
| kaua'i | 2655 | 2.1% |
| new | 2291 | 1.8% |
| jamaica | 2258 | 1.8% |
| isla | 2167 | 1.7% |
| luzon | 2129 | 1.7% |
| Other values (2138) | 83021 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 125254 | |
| i | 61933 | 7.9% |
| n | 52763 | 6.7% |
| o | 47475 | 6.0% |
| 44013 | 5.6% | |
| l | 41457 | 5.3% |
| u | 38087 | 4.8% |
| e | 37522 | 4.8% |
| s | 35293 | 4.5% |
| r | 27973 | 3.6% |
| Other values (66) | 274852 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 602963 | |
| Uppercase Letter | 123048 | 15.6% |
| Space Separator | 44013 | 5.6% |
| Other Punctuation | 9305 | 1.2% |
| Open Punctuation | 3492 | 0.4% |
| Close Punctuation | 3492 | 0.4% |
| Dash Punctuation | 309 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 125254 | |
| i | 61933 | |
| n | 52763 | |
| o | 47475 | 7.9% |
| l | 41457 | 6.9% |
| u | 38087 | 6.3% |
| e | 37522 | 6.2% |
| s | 35293 | 5.9% |
| r | 27973 | 4.6% |
| t | 23875 | 4.0% |
| Other values (28) | 111331 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 16106 | |
| I | 14801 | |
| C | 13357 | 10.9% |
| S | 9515 | 7.7% |
| M | 7530 | 6.1% |
| B | 5675 | 4.6% |
| T | 5620 | 4.6% |
| G | 5558 | 4.5% |
| K | 5090 | 4.1% |
| O | 5019 | 4.1% |
| Other values (17) | 34777 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 4593 | |
| . | 4379 | |
| , | 297 | 3.2% |
| ? | 32 | 0.3% |
| / | 4 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 2963 | |
| ( | 529 | 15.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 2963 | |
| ) | 529 | 15.1% |
Space Separator
| Value | Count | Frequency (%) |
| 44013 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 309 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 726011 | |
| Common | 60611 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 125254 | |
| i | 61933 | 8.5% |
| n | 52763 | 7.3% |
| o | 47475 | 6.5% |
| l | 41457 | 5.7% |
| u | 38087 | 5.2% |
| e | 37522 | 5.2% |
| s | 35293 | 4.9% |
| r | 27973 | 3.9% |
| t | 23875 | 3.3% |
| Other values (55) | 234379 |
Common
| Value | Count | Frequency (%) |
| 44013 | ||
| ' | 4593 | 7.6% |
| . | 4379 | 7.2% |
| [ | 2963 | 4.9% |
| ] | 2963 | 4.9% |
| ) | 529 | 0.9% |
| ( | 529 | 0.9% |
| - | 309 | 0.5% |
| , | 297 | 0.5% |
| ? | 32 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 785195 | |
| None | 1427 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 125254 | |
| i | 61933 | 7.9% |
| n | 52763 | 6.7% |
| o | 47475 | 6.0% |
| 44013 | 5.6% | |
| l | 41457 | 5.3% |
| u | 38087 | 4.9% |
| e | 37522 | 4.8% |
| s | 35293 | 4.5% |
| r | 27973 | 3.6% |
| Other values (52) | 273425 |
None
| Value | Count | Frequency (%) |
| ç | 423 | |
| Î | 320 | |
| é | 214 | |
| ó | 162 | 11.4% |
| á | 116 | 8.1% |
| â | 72 | 5.0% |
| ñ | 57 | 4.0% |
| ã | 36 | 2.5% |
| í | 9 | 0.6% |
| Ö | 7 | 0.5% |
| Other values (4) | 11 | 0.8% |
countryCode
Text
Missing 
| Distinct | 233 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 10855 |
| Missing (%) | 1.1% |
| Memory size | 7.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | US |
|---|---|
| 2nd row | PE |
| 3rd row | AR |
| 4th row | VE |
| 5th row | ZA |
| Value | Count | Frequency (%) |
| us | 291222 | |
| br | 65995 | 6.8% |
| mx | 63561 | 6.5% |
| co | 36051 | 3.7% |
| ve | 26234 | 2.7% |
| pe | 25485 | 2.6% |
| ca | 24554 | 2.5% |
| cn | 23614 | 2.4% |
| ec | 19520 | 2.0% |
| ph | 18818 | 1.9% |
| Other values (223) | 382493 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 323622 | |
| S | 314243 | |
| C | 149828 | 7.7% |
| R | 124243 | 6.4% |
| P | 103814 | 5.3% |
| B | 95778 | 4.9% |
| M | 95536 | 4.9% |
| E | 89602 | 4.6% |
| A | 79231 | 4.1% |
| X | 63569 | 3.3% |
| Other values (16) | 515628 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1955094 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 323622 | |
| S | 314243 | |
| C | 149828 | 7.7% |
| R | 124243 | 6.4% |
| P | 103814 | 5.3% |
| B | 95778 | 4.9% |
| M | 95536 | 4.9% |
| E | 89602 | 4.6% |
| A | 79231 | 4.1% |
| X | 63569 | 3.3% |
| Other values (16) | 515628 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1955094 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 323622 | |
| S | 314243 | |
| C | 149828 | 7.7% |
| R | 124243 | 6.4% |
| P | 103814 | 5.3% |
| B | 95778 | 4.9% |
| M | 95536 | 4.9% |
| E | 89602 | 4.6% |
| A | 79231 | 4.1% |
| X | 63569 | 3.3% |
| Other values (16) | 515628 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1955094 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 323622 | |
| S | 314243 | |
| C | 149828 | 7.7% |
| R | 124243 | 6.4% |
| P | 103814 | 5.3% |
| B | 95778 | 4.9% |
| M | 95536 | 4.9% |
| E | 89602 | 4.6% |
| A | 79231 | 4.1% |
| X | 63569 | 3.3% |
| Other values (16) | 515628 |
stateProvince
Text
Missing 
| Distinct | 3164 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 219376 |
| Missing (%) | 22.2% |
| Memory size | 7.5 MiB |
Length
| Max length | 52 |
|---|---|
| Median length | 49 |
| Mean length | 9.001383568 |
| Min length | 1 |
Unique
| Unique | 709 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | Florida |
|---|---|
| 2nd row | Piura |
| 3rd row | Formosa |
| 4th row | Carabobo |
| 5th row | Manabí |
| Value | Count | Frequency (%) |
| california | 44326 | 4.4% |
| new | 23059 | 2.3% |
| florida | 19421 | 1.9% |
| virginia | 15940 | 1.6% |
| texas | 15589 | 1.5% |
| alaska | 14760 | 1.5% |
| amazonas | 13297 | 1.3% |
| hawaii | 12078 | 1.2% |
| arizona | 11151 | 1.1% |
| san | 11038 | 1.1% |
| Other values (2927) | 831105 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1075650 | |
| i | 567474 | 8.2% |
| n | 508752 | 7.3% |
| o | 506284 | 7.3% |
| r | 439940 | 6.4% |
| e | 348295 | 5.0% |
| s | 278225 | 4.0% |
| l | 274188 | 4.0% |
| t | 243136 | 3.5% |
| 242738 | 3.5% | |
| Other values (109) | 2437616 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5615213 | |
| Uppercase Letter | 1010188 | 14.6% |
| Space Separator | 242738 | 3.5% |
| Dash Punctuation | 25671 | 0.4% |
| Other Punctuation | 18649 | 0.3% |
| Open Punctuation | 4913 | 0.1% |
| Close Punctuation | 4913 | 0.1% |
| Modifier Letter | 13 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1075650 | |
| i | 567474 | |
| n | 508752 | |
| o | 506284 | |
| r | 439940 | 7.8% |
| e | 348295 | 6.2% |
| s | 278225 | 5.0% |
| l | 274188 | 4.9% |
| t | 243136 | 4.3% |
| u | 231162 | 4.1% |
| Other values (56) | 1142107 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 156651 | |
| M | 98977 | 9.8% |
| S | 83880 | 8.3% |
| A | 79458 | 7.9% |
| N | 66806 | 6.6% |
| P | 56759 | 5.6% |
| T | 43026 | 4.3% |
| B | 39393 | 3.9% |
| V | 39298 | 3.9% |
| L | 34505 | 3.4% |
| Other values (30) | 311435 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 13890 | |
| / | 2984 | 16.0% |
| , | 834 | 4.5% |
| ' | 765 | 4.1% |
| ? | 147 | 0.8% |
| & | 29 | 0.2% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 4701 | |
| [ | 212 | 4.3% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 4701 | |
| ] | 212 | 4.3% |
Space Separator
| Value | Count | Frequency (%) |
| 242738 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 25671 |
Modifier Letter
| Value | Count | Frequency (%) |
| ʼ | 13 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6625401 | |
| Common | 296897 | 4.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1075650 | |
| i | 567474 | 8.6% |
| n | 508752 | 7.7% |
| o | 506284 | 7.6% |
| r | 439940 | 6.6% |
| e | 348295 | 5.3% |
| s | 278225 | 4.2% |
| l | 274188 | 4.1% |
| t | 243136 | 3.7% |
| u | 231162 | 3.5% |
| Other values (96) | 2152295 |
Common
| Value | Count | Frequency (%) |
| 242738 | ||
| - | 25671 | 8.6% |
| . | 13890 | 4.7% |
| ( | 4701 | 1.6% |
| ) | 4701 | 1.6% |
| / | 2984 | 1.0% |
| , | 834 | 0.3% |
| ' | 765 | 0.3% |
| ] | 212 | 0.1% |
| [ | 212 | 0.1% |
| Other values (3) | 189 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6837424 | |
| None | 84856 | 1.2% |
| Modifier Letters | 13 | < 0.1% |
| Latin Ext Additional | 5 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1075650 | |
| i | 567474 | 8.3% |
| n | 508752 | 7.4% |
| o | 506284 | 7.4% |
| r | 439940 | 6.4% |
| e | 348295 | 5.1% |
| s | 278225 | 4.1% |
| l | 274188 | 4.0% |
| t | 243136 | 3.6% |
| 242738 | 3.6% | |
| Other values (54) | 2352742 |
None
| Value | Count | Frequency (%) |
| á | 30893 | |
| í | 17685 | |
| é | 13624 | |
| ó | 9696 | 11.4% |
| ã | 4720 | 5.6% |
| ô | 2764 | 3.3% |
| ñ | 1309 | 1.5% |
| ü | 921 | 1.1% |
| ä | 569 | 0.7% |
| ö | 452 | 0.5% |
| Other values (42) | 2223 | 2.6% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʼ | 13 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ṭ | 4 | |
| ồ | 1 | 20.0% |
county
Text
Missing 
| Distinct | 7486 |
|---|---|
| Distinct (%) | 4.6% |
| Missing | 826754 |
| Missing (%) | 83.6% |
| Memory size | 7.5 MiB |
Length
| Max length | 49 |
|---|---|
| Median length | 44 |
| Mean length | 9.169770118 |
| Min length | 1 |
Unique
| Unique | 2670 ? |
|---|---|
| Unique (%) | 1.7% |
Sample
| 1st row | Parroquia |
|---|---|
| 2nd row | Duval |
| 3rd row | Boulder |
| 4th row | Cantal |
| 5th row | Arlington |
| Value | Count | Frequency (%) |
| county | 12307 | 5.4% |
| san | 7180 | 3.2% |
| prince | 4211 | 1.8% |
| honolulu | 4162 | 1.8% |
| santa | 3941 | 1.7% |
| los | 3095 | 1.4% |
| angeles | 3053 | 1.3% |
| montgomery | 3051 | 1.3% |
| george's | 2971 | 1.3% |
| maui | 2856 | 1.3% |
| Other values (6200) | 181096 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 169425 | 11.4% |
| o | 120482 | 8.1% |
| n | 117091 | 7.9% |
| e | 113146 | 7.6% |
| r | 92576 | 6.2% |
| i | 86084 | 5.8% |
| t | 67805 | 4.6% |
| u | 67402 | 4.5% |
| 66275 | 4.5% | |
| l | 63833 | 4.3% |
| Other values (93) | 518156 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1186552 | |
| Uppercase Letter | 222119 | 15.0% |
| Space Separator | 66275 | 4.5% |
| Other Punctuation | 5600 | 0.4% |
| Dash Punctuation | 1348 | 0.1% |
| Open Punctuation | 169 | < 0.1% |
| Close Punctuation | 169 | < 0.1% |
| Modifier Symbol | 25 | < 0.1% |
| Decimal Number | 17 | < 0.1% |
| Nonspacing Mark | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 169425 | |
| o | 120482 | |
| n | 117091 | |
| e | 113146 | |
| r | 92576 | 7.8% |
| i | 86084 | 7.3% |
| t | 67805 | 5.7% |
| u | 67402 | 5.7% |
| l | 63833 | 5.4% |
| s | 53168 | 4.5% |
| Other values (36) | 235540 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 32957 | |
| S | 24240 | |
| M | 23123 | 10.4% |
| B | 15159 | 6.8% |
| P | 14024 | 6.3% |
| A | 13349 | 6.0% |
| H | 11880 | 5.3% |
| L | 11134 | 5.0% |
| G | 8902 | 4.0% |
| F | 7551 | 3.4% |
| Other values (26) | 59800 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 3366 | |
| . | 1205 | 21.5% |
| / | 820 | 14.6% |
| ? | 134 | 2.4% |
| , | 66 | 1.2% |
| & | 4 | 0.1% |
| ; | 2 | < 0.1% |
| ¡ | 2 | < 0.1% |
| \ | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 8 | |
| 2 | 7 | |
| 9 | 1 | 5.9% |
| 6 | 1 | 5.9% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 156 | |
| [ | 13 | 7.7% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 156 | |
| ] | 13 | 7.7% |
Space Separator
| Value | Count | Frequency (%) |
| 66275 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1348 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 25 |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ́ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1408671 | |
| Common | 73603 | 5.0% |
| Inherited | 1 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 169425 | 12.0% |
| o | 120482 | 8.6% |
| n | 117091 | 8.3% |
| e | 113146 | 8.0% |
| r | 92576 | 6.6% |
| i | 86084 | 6.1% |
| t | 67805 | 4.8% |
| u | 67402 | 4.8% |
| l | 63833 | 4.5% |
| s | 53168 | 3.8% |
| Other values (72) | 457659 |
Common
| Value | Count | Frequency (%) |
| 66275 | ||
| ' | 3366 | 4.6% |
| - | 1348 | 1.8% |
| . | 1205 | 1.6% |
| / | 820 | 1.1% |
| ( | 156 | 0.2% |
| ) | 156 | 0.2% |
| ? | 134 | 0.2% |
| , | 66 | 0.1% |
| ´ | 25 | < 0.1% |
| Other values (10) | 52 | 0.1% |
Inherited
| Value | Count | Frequency (%) |
| ́ | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1468756 | |
| None | 13518 | 0.9% |
| Diacriticals | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 169425 | 11.5% |
| o | 120482 | 8.2% |
| n | 117091 | 8.0% |
| e | 113146 | 7.7% |
| r | 92576 | 6.3% |
| i | 86084 | 5.9% |
| t | 67805 | 4.6% |
| u | 67402 | 4.6% |
| 66275 | 4.5% | |
| l | 63833 | 4.3% |
| Other values (60) | 504637 |
None
| Value | Count | Frequency (%) |
| á | 2802 | |
| é | 2321 | |
| í | 2210 | |
| ó | 1905 | |
| ã | 1610 | |
| ç | 989 | 7.3% |
| è | 288 | 2.1% |
| ô | 265 | 2.0% |
| ñ | 240 | 1.8% |
| ê | 235 | 1.7% |
| Other values (22) | 653 | 4.8% |
Diacriticals
| Value | Count | Frequency (%) |
| ́ | 1 |
locality
Text
Missing 
| Distinct | 617492 |
|---|---|
| Distinct (%) | 67.4% |
| Missing | 72708 |
| Missing (%) | 7.4% |
| Memory size | 7.5 MiB |
Length
| Max length | 373493 |
|---|---|
| Median length | 322 |
| Mean length | 47.98270492 |
| Min length | 1 |
Unique
| Unique | 529251 ? |
|---|---|
| Unique (%) | 57.8% |
Sample
| 1st row | Gulf of Mexico |
|---|---|
| 2nd row | Dept. Piura: Ayabaca |
| 3rd row | Dep. Pilcomayo. al E a 2 Km de P. Porteño. |
| 4th row | Selva siempre verde en las quebradas al norte de Los Tanques, arriba de la Planta Eléctrica, en las cabeceras del Río San Gián, al sur de Borburata. |
| 5th row | Flat terrain near Skukuza rest camp, Kruger National Park. |
| Value | Count | Frequency (%) |
| of | 347842 | 5.0% |
| de | 133785 | 1.9% |
| the | 82580 | 1.2% |
| km | 81320 | 1.2% |
| near | 74890 | 1.1% |
| on | 60171 | 0.9% |
| and | 59914 | 0.9% |
| in | 57394 | 0.8% |
| county | 55824 | 0.8% |
| la | 50630 | 0.7% |
| Other values (260445) | 5924543 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5977545 | 13.6% | |
| a | 3997394 | 9.1% |
| e | 3081991 | 7.0% |
| o | 2904902 | 6.6% |
| n | 2422306 | 5.5% |
| i | 2249317 | 5.1% |
| r | 2223035 | 5.1% |
| t | 1935539 | 4.4% |
| l | 1565888 | 3.6% |
| s | 1518943 | 3.5% |
| Other values (299) | 16060615 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 30180500 | |
| Space Separator | 5977545 | 13.6% |
| Uppercase Letter | 4570901 | 10.4% |
| Other Punctuation | 2198902 | 5.0% |
| Decimal Number | 593299 | 1.4% |
| Dash Punctuation | 123186 | 0.3% |
| Control | 116404 | 0.3% |
| Open Punctuation | 80204 | 0.2% |
| Close Punctuation | 79527 | 0.2% |
| Connector Punctuation | 6065 | < 0.1% |
| Other values (11) | 10942 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 3997394 | |
| e | 3081991 | |
| o | 2904902 | |
| n | 2422306 | 8.0% |
| i | 2249317 | 7.5% |
| r | 2223035 | 7.4% |
| t | 1935539 | 6.4% |
| l | 1565888 | 5.2% |
| s | 1518943 | 5.0% |
| u | 1126719 | 3.7% |
| Other values (121) | 7154466 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 535962 | 11.7% |
| S | 451705 | 9.9% |
| M | 342129 | 7.5% |
| P | 333055 | 7.3% |
| R | 291032 | 6.4% |
| B | 257123 | 5.6% |
| A | 233082 | 5.1% |
| N | 230710 | 5.0% |
| L | 203826 | 4.5% |
| T | 191753 | 4.2% |
| Other values (64) | 1500524 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1227571 | |
| , | 747186 | |
| : | 104272 | 4.7% |
| ; | 41055 | 1.9% |
| ' | 32466 | 1.5% |
| " | 21999 | 1.0% |
| / | 11348 | 0.5% |
| & | 8984 | 0.4% |
| # | 1963 | 0.1% |
| ? | 1416 | 0.1% |
| Other values (10) | 642 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 114514 | |
| 2 | 87351 | |
| 0 | 72576 | |
| 5 | 66155 | |
| 3 | 61822 | |
| 4 | 50921 | |
| 6 | 40384 | 6.8% |
| 8 | 34709 | 5.9% |
| 7 | 34671 | 5.8% |
| 9 | 30196 | 5.1% |
Math Symbol
| Value | Count | Frequency (%) |
| = | 1342 | |
| ± | 1154 | |
| + | 684 | |
| > | 257 | 6.6% |
| < | 247 | 6.3% |
| ~ | 219 | 5.6% |
| | | 7 | 0.2% |
| ∆ | 3 | 0.1% |
| → | 3 | 0.1% |
| × | 2 | 0.1% |
Control
| Value | Count | Frequency (%) |
| 115843 | ||
| 522 | 0.4% | |
| | 13 | < 0.1% |
| | 11 | < 0.1% |
| | 7 | < 0.1% |
| | 4 | < 0.1% |
| | 3 | < 0.1% |
| | 1 | < 0.1% |
Other Number
| Value | Count | Frequency (%) |
| ½ | 2797 | |
| ¼ | 1213 | |
| ¾ | 156 | 3.7% |
| ² | 14 | 0.3% |
| ⅓ | 11 | 0.3% |
| ⅛ | 3 | 0.1% |
| ⅜ | 2 | < 0.1% |
Format
| Value | Count | Frequency (%) |
| | 2 | |
| | 2 | |
| | 2 | |
| | 1 | |
| | 1 | |
| | 1 | |
| | 1 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 59332 | |
| [ | 20706 | 25.8% |
| „ | 100 | 0.1% |
| ‚ | 44 | 0.1% |
| { | 22 | < 0.1% |
Other Symbol
| Value | Count | Frequency (%) |
| ° | 1435 | |
| ├ | 2 | 0.1% |
| ┬ | 1 | 0.1% |
| ░ | 1 | 0.1% |
| ¦ | 1 | 0.1% |
Nonspacing Mark
| Value | Count | Frequency (%) |
| ͤ | 2 | |
| ̈ | 2 | |
| ̋ | 1 | |
| ́ | 1 | |
| ᷉ | 1 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ´ | 101 | |
| ¨ | 2 | 1.9% |
| ^ | 2 | 1.9% |
| ˶ | 1 | 0.9% |
Currency Symbol
| Value | Count | Frequency (%) |
| ¢ | 25 | |
| ¤ | 12 | |
| $ | 4 | 9.5% |
| £ | 1 | 2.4% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 123170 | |
| – | 10 | < 0.1% |
| — | 6 | < 0.1% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 58762 | |
| ] | 20740 | 26.1% |
| } | 25 | < 0.1% |
Initial Punctuation
| Value | Count | Frequency (%) |
| « | 175 | |
| “ | 35 | 16.6% |
| ‛ | 1 | 0.5% |
Final Punctuation
| Value | Count | Frequency (%) |
| » | 172 | |
| ” | 9 | 4.8% |
| › | 6 | 3.2% |
Modifier Letter
| Value | Count | Frequency (%) |
| ʻ | 91 | |
| ᶵ | 1 | 1.1% |
| ᵍ | 1 | 1.1% |
Other Letter
| Value | Count | Frequency (%) |
| º | 708 | |
| ª | 24 | 3.3% |
Space Separator
| Value | Count | Frequency (%) |
| 5977545 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 6065 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 34752091 | |
| Common | 9185332 | 20.9% |
| Greek | 43 | < 0.1% |
| Inherited | 8 | < 0.1% |
| Cyrillic | 1 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 3997394 | 11.5% |
| e | 3081991 | 8.9% |
| o | 2904902 | 8.4% |
| n | 2422306 | 7.0% |
| i | 2249317 | 6.5% |
| r | 2223035 | 6.4% |
| t | 1935539 | 5.6% |
| l | 1565888 | 4.5% |
| s | 1518943 | 4.4% |
| u | 1126719 | 3.2% |
| Other values (191) | 11726057 |
Common
| Value | Count | Frequency (%) |
| 5977545 | ||
| . | 1227571 | 13.4% |
| , | 747186 | 8.1% |
| - | 123170 | 1.3% |
| 115843 | 1.3% | |
| 1 | 114514 | 1.2% |
| : | 104272 | 1.1% |
| 2 | 87351 | 1.0% |
| 0 | 72576 | 0.8% |
| 5 | 66155 | 0.7% |
| Other values (84) | 549149 | 6.0% |
Greek
| Value | Count | Frequency (%) |
| λ | 12 | |
| Κ | 6 | |
| ν | 6 | |
| η | 6 | |
| υ | 6 | |
| ή | 6 | |
| Δ | 1 | 2.3% |
Inherited
| Value | Count | Frequency (%) |
| ͤ | 2 | |
| ̈ | 2 | |
| | 1 | |
| ̋ | 1 | |
| ́ | 1 | |
| ᷉ | 1 |
Cyrillic
| Value | Count | Frequency (%) |
| ҫ | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 43737376 | |
| None | 199715 | 0.5% |
| Punctuation | 244 | < 0.1% |
| Modifier Letters | 92 | < 0.1% |
| Number Forms | 16 | < 0.1% |
| Latin Ext Additional | 10 | < 0.1% |
| Diacriticals | 6 | < 0.1% |
| Math Operators | 3 | < 0.1% |
| Arrows | 3 | < 0.1% |
| Box Drawing | 3 | < 0.1% |
| Other values (6) | 7 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5977545 | 13.7% | |
| a | 3997394 | 9.1% |
| e | 3081991 | 7.0% |
| o | 2904902 | 6.6% |
| n | 2422306 | 5.5% |
| i | 2249317 | 5.1% |
| r | 2223035 | 5.1% |
| t | 1935539 | 4.4% |
| l | 1565888 | 3.6% |
| s | 1518943 | 3.5% |
| Other values (87) | 15860516 |
None
| Value | Count | Frequency (%) |
| í | 46285 | |
| á | 36812 | |
| é | 24344 | |
| ó | 20532 | |
| ñ | 10164 | 5.1% |
| ã | 7957 | 4.0% |
| ú | 5901 | 3.0% |
| ç | 5120 | 2.6% |
| ü | 4177 | 2.1% |
| ä | 3757 | 1.9% |
| Other values (161) | 34666 |
Punctuation
| Value | Count | Frequency (%) |
| „ | 100 | |
| ‚ | 44 | |
| “ | 35 | 14.3% |
| … | 26 | 10.7% |
| – | 10 | 4.1% |
| ” | 9 | 3.7% |
| › | 6 | 2.5% |
| — | 6 | 2.5% |
| | 2 | 0.8% |
| | 1 | 0.4% |
| Other values (5) | 5 | 2.0% |
Modifier Letters
| Value | Count | Frequency (%) |
| ʻ | 91 | |
| ˶ | 1 | 1.1% |
Number Forms
| Value | Count | Frequency (%) |
| ⅓ | 11 | |
| ⅛ | 3 | 18.8% |
| ⅜ | 2 | 12.5% |
Math Operators
| Value | Count | Frequency (%) |
| ∆ | 3 |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ị | 3 | |
| ẽ | 2 | |
| ṅ | 1 | 10.0% |
| ắ | 1 | 10.0% |
| ả | 1 | 10.0% |
| ạ | 1 | 10.0% |
| ế | 1 | 10.0% |
Arrows
| Value | Count | Frequency (%) |
| → | 3 |
Diacriticals
| Value | Count | Frequency (%) |
| ͤ | 2 | |
| ̈ | 2 | |
| ̋ | 1 | |
| ́ | 1 |
Box Drawing
| Value | Count | Frequency (%) |
| ├ | 2 | |
| ┬ | 1 |
IPA Ext
| Value | Count | Frequency (%) |
| ɶ | 2 |
Phonetic Ext Sup
| Value | Count | Frequency (%) |
| ᶵ | 1 |
Block Elements
| Value | Count | Frequency (%) |
| ░ | 1 |
Diacriticals Sup
| Value | Count | Frequency (%) |
| ᷉ | 1 |
Phonetic Ext
| Value | Count | Frequency (%) |
| ᵍ | 1 |
Cyrillic
| Value | Count | Frequency (%) |
| ҫ | 1 |
verbatimDepth
Text
Missing 
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 983702 |
| Missing (%) | 99.5% |
| Memory size | 7.5 MiB |
Length
| Max length | 29 |
|---|---|
| Median length | 3 |
| Mean length | 3.033617021 |
| Min length | 2 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | ca. |
|---|---|
| 2nd row | ca. |
| 3rd row | ca. |
| 4th row | ca. |
| 5th row | ca. |
| Value | Count | Frequency (%) |
| ca | 4691 | |
| intertidal | 11 | 0.2% |
| mlw | 6 | 0.1% |
| above | 4 | 0.1% |
| below | 2 | < 0.1% |
| infralittoral | 1 | < 0.1% |
| 4-8 | 1 | < 0.1% |
| feet | 1 | < 0.1% |
| mean | 1 | < 0.1% |
| low | 1 | < 0.1% |
| Other values (5) | 5 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4710 | |
| c | 4691 | |
| . | 4653 | |
| t | 27 | 0.2% |
| 24 | 0.2% | |
| l | 23 | 0.2% |
| e | 21 | 0.1% |
| r | 14 | 0.1% |
| n | 13 | 0.1% |
| i | 13 | 0.1% |
| Other values (15) | 69 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9565 | |
| Other Punctuation | 4653 | |
| Space Separator | 24 | 0.2% |
| Uppercase Letter | 11 | 0.1% |
| Decimal Number | 3 | < 0.1% |
| Dash Punctuation | 1 | < 0.1% |
| Math Symbol | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4710 | |
| c | 4691 | |
| t | 27 | 0.3% |
| l | 23 | 0.2% |
| e | 21 | 0.2% |
| r | 14 | 0.1% |
| n | 13 | 0.1% |
| i | 13 | 0.1% |
| d | 11 | 0.1% |
| w | 10 | 0.1% |
| Other values (7) | 32 | 0.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 1 | |
| 8 | 1 | |
| 1 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4653 |
Space Separator
| Value | Count | Frequency (%) |
| 24 |
Uppercase Letter
| Value | Count | Frequency (%) |
| I | 11 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Math Symbol
| Value | Count | Frequency (%) |
| < | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9576 | |
| Common | 4682 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4710 | |
| c | 4691 | |
| t | 27 | 0.3% |
| l | 23 | 0.2% |
| e | 21 | 0.2% |
| r | 14 | 0.1% |
| n | 13 | 0.1% |
| i | 13 | 0.1% |
| I | 11 | 0.1% |
| d | 11 | 0.1% |
| Other values (8) | 42 | 0.4% |
Common
| Value | Count | Frequency (%) |
| . | 4653 | |
| 24 | 0.5% | |
| 4 | 1 | < 0.1% |
| - | 1 | < 0.1% |
| 8 | 1 | < 0.1% |
| < | 1 | < 0.1% |
| 1 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14258 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4710 | |
| c | 4691 | |
| . | 4653 | |
| t | 27 | 0.2% |
| 24 | 0.2% | |
| l | 23 | 0.2% |
| e | 21 | 0.1% |
| r | 14 | 0.1% |
| n | 13 | 0.1% |
| i | 13 | 0.1% |
| Other values (15) | 69 | 0.5% |
decimalLatitude
Text
Missing 
| Distinct | 30964 |
|---|---|
| Distinct (%) | 21.0% |
| Missing | 841005 |
| Missing (%) | 85.1% |
| Memory size | 7.5 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 5.800233383 |
| Min length | 3 |
Unique
| Unique | 16592 ? |
|---|---|
| Unique (%) | 11.3% |
Sample
| 1st row | 26.2786 |
|---|---|
| 2nd row | -35.57 |
| 3rd row | 18.6519 |
| 4th row | -36.68 |
| 5th row | 5.86667 |
| Value | Count | Frequency (%) |
| 38.9694 | 858 | 0.6% |
| 38.895 | 856 | 0.6% |
| 9.405 | 393 | 0.3% |
| 0.83 | 372 | 0.3% |
| 0.35 | 371 | 0.3% |
| 3.61 | 370 | 0.3% |
| 5.16667 | 340 | 0.2% |
| 5.2 | 335 | 0.2% |
| 38.8664 | 324 | 0.2% |
| 12.83 | 312 | 0.2% |
| Other values (28342) | 142866 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 147397 | |
| 3 | 100620 | |
| 1 | 79516 | |
| 8 | 73185 | |
| 2 | 72959 | |
| 5 | 68749 | |
| 6 | 65516 | |
| 7 | 59644 | |
| 4 | 52733 | 6.2% |
| 9 | 50290 | 5.9% |
| Other values (2) | 84328 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 665840 | |
| Other Punctuation | 147397 | 17.2% |
| Dash Punctuation | 41700 | 4.9% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 100620 | |
| 1 | 79516 | |
| 8 | 73185 | |
| 2 | 72959 | |
| 5 | 68749 | |
| 6 | 65516 | |
| 7 | 59644 | |
| 4 | 52733 | |
| 9 | 50290 | |
| 0 | 42628 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 147397 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 41700 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 854937 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 147397 | |
| 3 | 100620 | |
| 1 | 79516 | |
| 8 | 73185 | |
| 2 | 72959 | |
| 5 | 68749 | |
| 6 | 65516 | |
| 7 | 59644 | |
| 4 | 52733 | 6.2% |
| 9 | 50290 | 5.9% |
| Other values (2) | 84328 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 854937 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 147397 | |
| 3 | 100620 | |
| 1 | 79516 | |
| 8 | 73185 | |
| 2 | 72959 | |
| 5 | 68749 | |
| 6 | 65516 | |
| 7 | 59644 | |
| 4 | 52733 | 6.2% |
| 9 | 50290 | 5.9% |
| Other values (2) | 84328 |
decimalLongitude
Text
Missing 
| Distinct | 32805 |
|---|---|
| Distinct (%) | 22.3% |
| Missing | 841005 |
| Missing (%) | 85.1% |
| Memory size | 7.5 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 10 |
| Mean length | 6.791515431 |
| Min length | 3 |
Unique
| Unique | 17344 ? |
|---|---|
| Unique (%) | 11.8% |
Sample
| 1st row | -83.7803 |
|---|---|
| 2nd row | 137.32 |
| 3rd row | -71.5572 |
| 4th row | -72.97 |
| 5th row | -60.5667 |
| Value | Count | Frequency (%) |
| 77.1767 | 842 | 0.6% |
| 77.0367 | 831 | 0.6% |
| 59.4833 | 487 | 0.3% |
| 53.2 | 382 | 0.3% |
| 79.8635 | 382 | 0.3% |
| 52.33 | 360 | 0.2% |
| 59.48 | 325 | 0.2% |
| 79.73 | 307 | 0.2% |
| 88.08 | 302 | 0.2% |
| 70.95 | 301 | 0.2% |
| Other values (31262) | 142878 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 147397 | |
| - | 125223 | |
| 7 | 109174 | |
| 1 | 87815 | |
| 6 | 84392 | |
| 5 | 83702 | |
| 3 | 71057 | |
| 8 | 70903 | |
| 9 | 59906 | |
| 2 | 57756 | 5.8% |
| Other values (2) | 103724 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 728429 | |
| Other Punctuation | 147397 | 14.7% |
| Dash Punctuation | 125223 | 12.5% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 109174 | |
| 1 | 87815 | |
| 6 | 84392 | |
| 5 | 83702 | |
| 3 | 71057 | |
| 8 | 70903 | |
| 9 | 59906 | |
| 2 | 57756 | |
| 4 | 51961 | |
| 0 | 51763 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 147397 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 125223 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1001049 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 147397 | |
| - | 125223 | |
| 7 | 109174 | |
| 1 | 87815 | |
| 6 | 84392 | |
| 5 | 83702 | |
| 3 | 71057 | |
| 8 | 70903 | |
| 9 | 59906 | |
| 2 | 57756 | 5.8% |
| Other values (2) | 103724 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1001049 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 147397 | |
| - | 125223 | |
| 7 | 109174 | |
| 1 | 87815 | |
| 6 | 84392 | |
| 5 | 83702 | |
| 3 | 71057 | |
| 8 | 70903 | |
| 9 | 59906 | |
| 2 | 57756 | 5.8% |
| Other values (2) | 103724 |
coordinateUncertaintyInMeters
Text
Missing 
| Distinct | 20 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 987002 |
| Missing (%) | 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 6 |
| Mean length | 5.855 |
| Min length | 4 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | 16000.0 |
|---|---|
| 2nd row | 1500.0 |
| 3rd row | 250.0 |
| 4th row | 500.0 |
| 5th row | 1500.0 |
| Value | Count | Frequency (%) |
| 16000.0 | 286 | |
| 1000.0 | 277 | |
| 500.0 | 234 | |
| 250.0 | 145 | |
| 3000.0 | 135 | |
| 5000.0 | 68 | 4.9% |
| 750.0 | 67 | 4.8% |
| 1500.0 | 51 | 3.6% |
| 2000.0 | 38 | 2.7% |
| 3500.0 | 32 | 2.3% |
| Other values (10) | 67 | 4.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 4801 | |
| . | 1400 | 17.1% |
| 1 | 647 | 7.9% |
| 5 | 613 | 7.5% |
| 6 | 291 | 3.6% |
| 2 | 196 | 2.4% |
| 3 | 171 | 2.1% |
| 7 | 67 | 0.8% |
| 8 | 9 | 0.1% |
| 4 | 2 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6797 | |
| Other Punctuation | 1400 | 17.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4801 | |
| 1 | 647 | 9.5% |
| 5 | 613 | 9.0% |
| 6 | 291 | 4.3% |
| 2 | 196 | 2.9% |
| 3 | 171 | 2.5% |
| 7 | 67 | 1.0% |
| 8 | 9 | 0.1% |
| 4 | 2 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1400 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8197 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 4801 | |
| . | 1400 | 17.1% |
| 1 | 647 | 7.9% |
| 5 | 613 | 7.5% |
| 6 | 291 | 3.6% |
| 2 | 196 | 2.4% |
| 3 | 171 | 2.1% |
| 7 | 67 | 0.8% |
| 8 | 9 | 0.1% |
| 4 | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8197 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 4801 | |
| . | 1400 | 17.1% |
| 1 | 647 | 7.9% |
| 5 | 613 | 7.5% |
| 6 | 291 | 3.6% |
| 2 | 196 | 2.4% |
| 3 | 171 | 2.1% |
| 7 | 67 | 0.8% |
| 8 | 9 | 0.1% |
| 4 | 2 | < 0.1% |
Missing 
| Distinct | 4 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 980404 |
| Missing (%) | 99.2% |
| Memory size | 7.5 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 23 |
| Mean length | 22.98012003 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Degrees Minutes Seconds |
|---|---|
| 2nd row | Degrees Minutes Seconds |
| 3rd row | Degrees Minutes Seconds |
| 4th row | Degrees Minutes Seconds |
| 5th row | Degrees Minutes Seconds |
| Value | Count | Frequency (%) |
| degrees | 7992 | |
| minutes | 7986 | |
| seconds | 7986 | |
| decimal | 6 | < 0.1% |
| quad | 5 | < 0.1% |
| unknown | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 39954 | |
| s | 23964 | |
| 15978 | 8.7% | |
| n | 15975 | 8.7% |
| D | 7997 | 4.4% |
| g | 7992 | 4.3% |
| r | 7992 | 4.3% |
| d | 7992 | 4.3% |
| i | 7992 | 4.3% |
| c | 7992 | 4.3% |
| Other values (13) | 39967 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 143832 | |
| Uppercase Letter | 23985 | 13.0% |
| Space Separator | 15978 | 8.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 39954 | |
| s | 23964 | |
| n | 15975 | 11.1% |
| g | 7992 | 5.6% |
| r | 7992 | 5.6% |
| d | 7992 | 5.6% |
| i | 7992 | 5.6% |
| c | 7992 | 5.6% |
| o | 7987 | 5.6% |
| t | 7986 | 5.6% |
| Other values (6) | 8006 | 5.6% |
Uppercase Letter
| Value | Count | Frequency (%) |
| D | 7997 | |
| S | 7986 | |
| M | 7986 | |
| U | 6 | < 0.1% |
| Q | 5 | < 0.1% |
| A | 5 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 15978 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 167817 | |
| Common | 15978 | 8.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 39954 | |
| s | 23964 | |
| n | 15975 | 9.5% |
| D | 7997 | 4.8% |
| g | 7992 | 4.8% |
| r | 7992 | 4.8% |
| d | 7992 | 4.8% |
| i | 7992 | 4.8% |
| c | 7992 | 4.8% |
| o | 7987 | 4.8% |
| Other values (12) | 31980 |
Common
| Value | Count | Frequency (%) |
| 15978 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 183795 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 39954 | |
| s | 23964 | |
| 15978 | 8.7% | |
| n | 15975 | 8.7% |
| D | 7997 | 4.4% |
| g | 7992 | 4.3% |
| r | 7992 | 4.3% |
| d | 7992 | 4.3% |
| i | 7992 | 4.3% |
| c | 7992 | 4.3% |
| Other values (13) | 39967 |
verbatimSRS
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1938-11-11 |
|---|
| Value | Count | Frequency (%) |
| 1938-11-11 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 5 | |
| - | 2 | 20.0% |
| 9 | 1 | 10.0% |
| 3 | 1 | 10.0% |
| 8 | 1 | 10.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 8 | |
| Dash Punctuation | 2 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 5 | |
| 9 | 1 | 12.5% |
| 3 | 1 | 12.5% |
| 8 | 1 | 12.5% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 5 | |
| - | 2 | 20.0% |
| 9 | 1 | 10.0% |
| 3 | 1 | 10.0% |
| 8 | 1 | 10.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 5 | |
| - | 2 | 20.0% |
| 9 | 1 | 10.0% |
| 3 | 1 | 10.0% |
| 8 | 1 | 10.0% |
footprintSRS
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 315 |
|---|
| Value | Count | Frequency (%) |
| 315 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 1 | |
| 1 | 1 | |
| 5 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 1 | |
| 1 | 1 | |
| 5 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 1 | |
| 1 | 1 | |
| 5 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 1 | |
| 1 | 1 | |
| 5 | 1 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 315 |
|---|
| Value | Count | Frequency (%) |
| 315 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3 | 1 | |
| 1 | 1 | |
| 5 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 1 | |
| 1 | 1 | |
| 5 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 3 | 1 | |
| 1 | 1 | |
| 5 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3 | 1 | |
| 1 | 1 | |
| 5 | 1 |
georeferencedBy
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1938 |
|---|
| Value | Count | Frequency (%) |
| 1938 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 9 | 1 | |
| 3 | 1 | |
| 8 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 9 | 1 | |
| 3 | 1 | |
| 8 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 9 | 1 | |
| 3 | 1 | |
| 8 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 1 | |
| 9 | 1 | |
| 3 | 1 | |
| 8 | 1 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 11 |
|---|
| Value | Count | Frequency (%) |
| 11 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 2 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2 |
Missing 
| Distinct | 20 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 960543 |
| Missing (%) | 97.2% |
| Memory size | 7.5 MiB |
Length
| Max length | 17 |
|---|---|
| Median length | 16 |
| Mean length | 8.355289135 |
| Min length | 2 |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Gazetteer |
|---|---|
| 2nd row | Gazetteer |
| 3rd row | Gazetteer |
| 4th row | Gazetteer |
| 5th row | Label |
| Value | Count | Frequency (%) |
| gazetteer | 10962 | |
| gps | 5054 | |
| gis | 4557 | |
| arcview | 4557 | |
| label | 3720 | 10.3% |
| 3348 | 9.3% | |
| maps | 2711 | 7.5% |
| earth | 637 | 1.8% |
| source | 400 | 1.1% |
| g-1 | 76 | 0.2% |
| Other values (11) | 162 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 44988 | |
| G | 23987 | 10.3% |
| t | 22571 | 9.7% |
| a | 18101 | 7.8% |
| r | 16566 | 7.1% |
| z | 10962 | 4.7% |
| S | 9997 | 4.3% |
| 8325 | 3.6% | |
| o | 7139 | 3.1% |
| l | 7094 | 3.0% |
| Other values (30) | 63040 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 160449 | |
| Uppercase Letter | 54726 | 23.5% |
| Space Separator | 8325 | 3.6% |
| Close Punctuation | 4557 | 2.0% |
| Open Punctuation | 4557 | 2.0% |
| Decimal Number | 78 | < 0.1% |
| Dash Punctuation | 76 | < 0.1% |
| Other Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 44988 | |
| t | 22571 | |
| a | 18101 | |
| r | 16566 | 10.3% |
| z | 10962 | 6.8% |
| o | 7139 | 4.4% |
| l | 7094 | 4.4% |
| c | 4961 | 3.1% |
| i | 4578 | 2.9% |
| w | 4559 | 2.8% |
| Other values (12) | 18930 |
Uppercase Letter
| Value | Count | Frequency (%) |
| G | 23987 | |
| S | 9997 | |
| P | 5040 | 9.2% |
| I | 4557 | 8.3% |
| A | 4557 | 8.3% |
| L | 3715 | 6.8% |
| M | 2172 | 4.0% |
| E | 637 | 1.2% |
| W | 53 | 0.1% |
| C | 9 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 8325 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 4557 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 4557 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 78 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 76 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 215175 | |
| Common | 17595 | 7.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 44988 | |
| G | 23987 | |
| t | 22571 | |
| a | 18101 | 8.4% |
| r | 16566 | 7.7% |
| z | 10962 | 5.1% |
| S | 9997 | 4.6% |
| o | 7139 | 3.3% |
| l | 7094 | 3.3% |
| P | 5040 | 2.3% |
| Other values (24) | 48730 |
Common
| Value | Count | Frequency (%) |
| 8325 | ||
| ) | 4557 | |
| ( | 4557 | |
| 1 | 78 | 0.4% |
| - | 76 | 0.4% |
| . | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 232770 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 44988 | |
| G | 23987 | 10.3% |
| t | 22571 | 9.7% |
| a | 18101 | 7.8% |
| r | 16566 | 7.1% |
| z | 10962 | 4.7% |
| S | 9997 | 4.3% |
| 8325 | 3.6% | |
| o | 7139 | 3.1% |
| l | 7094 | 3.0% |
| Other values (30) | 63040 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 11 |
| Mean length | 11 |
| Min length | 11 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 11 Nov 1938 |
|---|
| Value | Count | Frequency (%) |
| 11 | 1 | |
| nov | 1 | |
| 1938 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 3 | |
| 2 | ||
| N | 1 | 9.1% |
| o | 1 | 9.1% |
| v | 1 | 9.1% |
| 9 | 1 | 9.1% |
| 3 | 1 | 9.1% |
| 8 | 1 | 9.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6 | |
| Space Separator | 2 | 18.2% |
| Lowercase Letter | 2 | 18.2% |
| Uppercase Letter | 1 | 9.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 3 | |
| 9 | 1 | 16.7% |
| 3 | 1 | 16.7% |
| 8 | 1 | 16.7% |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 1 | |
| v | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 2 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 8 | |
| Latin | 3 | 27.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 3 | |
| 2 | ||
| 9 | 1 | 12.5% |
| 3 | 1 | 12.5% |
| 8 | 1 | 12.5% |
Latin
| Value | Count | Frequency (%) |
| N | 1 | |
| o | 1 | |
| v | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 3 | |
| 2 | ||
| N | 1 | 9.1% |
| o | 1 | 9.1% |
| v | 1 | 9.1% |
| 9 | 1 | 9.1% |
| 3 | 1 | 9.1% |
| 8 | 1 | 9.1% |
Missing 
| Distinct | 38 |
|---|---|
| Distinct (%) | 33.6% |
| Missing | 988289 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 53 |
|---|---|
| Median length | 40 |
| Mean length | 18.7699115 |
| Min length | 3 |
Unique
| Unique | 21 ? |
|---|---|
| Unique (%) | 18.6% |
Sample
| 1st row | +-1000m |
|---|---|
| 2nd row | stop 1 - beginning of bike path, along GW pkwy |
| 3rd row | ca.; ca. |
| 4th row | stop 1-ditch; stop 2- polkweed; stop 3; stop 4 |
| 5th row | Long. 4 8 W - 4 15 W |
| Value | Count | Frequency (%) |
| stop | 48 | 10.5% |
| 4 | 29 | 6.3% |
| 26 | 5.7% | |
| ca | 23 | 5.0% |
| w | 22 | 4.8% |
| 1 | 21 | 4.6% |
| invalid | 13 | 2.8% |
| of | 13 | 2.8% |
| as | 13 | 2.8% |
| seconds | 13 | 2.8% |
| Other values (63) | 238 |
Most occurring characters
| Value | Count | Frequency (%) |
| 346 | ||
| o | 142 | 6.7% |
| n | 124 | 5.8% |
| a | 118 | 5.6% |
| t | 118 | 5.6% |
| e | 116 | 5.5% |
| i | 113 | 5.3% |
| s | 94 | 4.4% |
| p | 82 | 3.9% |
| l | 81 | 3.8% |
| Other values (44) | 787 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1365 | |
| Space Separator | 346 | 16.3% |
| Decimal Number | 140 | 6.6% |
| Uppercase Letter | 140 | 6.6% |
| Other Punctuation | 82 | 3.9% |
| Dash Punctuation | 42 | 2.0% |
| Math Symbol | 6 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 142 | |
| n | 124 | |
| a | 118 | 8.6% |
| t | 118 | 8.6% |
| e | 116 | 8.5% |
| i | 113 | 8.3% |
| s | 94 | 6.9% |
| p | 82 | 6.0% |
| l | 81 | 5.9% |
| d | 75 | 5.5% |
| Other values (13) | 302 |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 35 | |
| S | 25 | |
| L | 15 | |
| G | 14 | 10.0% |
| T | 8 | 5.7% |
| M | 7 | 5.0% |
| U | 7 | 5.0% |
| C | 6 | 4.3% |
| V | 5 | 3.6% |
| F | 4 | 2.9% |
| Other values (5) | 14 | 10.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 44 | |
| 4 | 33 | |
| 0 | 18 | |
| 8 | 15 | 10.7% |
| 5 | 12 | 8.6% |
| 3 | 9 | 6.4% |
| 2 | 8 | 5.7% |
| 6 | 1 | 0.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 48 | |
| ; | 21 | |
| , | 9 | 11.0% |
| / | 3 | 3.7% |
| ' | 1 | 1.2% |
Space Separator
| Value | Count | Frequency (%) |
| 346 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 42 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1505 | |
| Common | 616 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 142 | 9.4% |
| n | 124 | 8.2% |
| a | 118 | 7.8% |
| t | 118 | 7.8% |
| e | 116 | 7.7% |
| i | 113 | 7.5% |
| s | 94 | 6.2% |
| p | 82 | 5.4% |
| l | 81 | 5.4% |
| d | 75 | 5.0% |
| Other values (28) | 442 |
Common
| Value | Count | Frequency (%) |
| 346 | ||
| . | 48 | 7.8% |
| 1 | 44 | 7.1% |
| - | 42 | 6.8% |
| 4 | 33 | 5.4% |
| ; | 21 | 3.4% |
| 0 | 18 | 2.9% |
| 8 | 15 | 2.4% |
| 5 | 12 | 1.9% |
| , | 9 | 1.5% |
| Other values (6) | 28 | 4.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2121 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 346 | ||
| o | 142 | 6.7% |
| n | 124 | 5.8% |
| a | 118 | 5.6% |
| t | 118 | 5.6% |
| e | 116 | 5.5% |
| i | 113 | 5.3% |
| s | 94 | 4.4% |
| p | 82 | 3.9% |
| l | 81 | 3.8% |
| Other values (44) | 787 |
latestEpochOrHighestSeries
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 42 |
|---|---|
| Median length | 42 |
| Mean length | 42 |
| Min length | 42 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | South America - Neotropics, Colombia, Meta |
|---|
| Value | Count | Frequency (%) |
| south | 1 | |
| america | 1 | |
| 1 | ||
| neotropics | 1 | |
| colombia | 1 | |
| meta | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 5 | ||
| o | 5 | |
| t | 3 | 7.1% |
| e | 3 | 7.1% |
| i | 3 | 7.1% |
| a | 3 | 7.1% |
| c | 2 | 4.8% |
| m | 2 | 4.8% |
| r | 2 | 4.8% |
| , | 2 | 4.8% |
| Other values (12) | 12 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 29 | |
| Space Separator | 5 | 11.9% |
| Uppercase Letter | 5 | 11.9% |
| Other Punctuation | 2 | 4.8% |
| Dash Punctuation | 1 | 2.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 5 | |
| t | 3 | |
| e | 3 | |
| i | 3 | |
| a | 3 | |
| c | 2 | 6.9% |
| m | 2 | 6.9% |
| r | 2 | 6.9% |
| p | 1 | 3.4% |
| b | 1 | 3.4% |
| Other values (4) | 4 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1 | |
| S | 1 | |
| N | 1 | |
| A | 1 | |
| M | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 5 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 2 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 34 | |
| Common | 8 | 19.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 5 | |
| t | 3 | 8.8% |
| e | 3 | 8.8% |
| i | 3 | 8.8% |
| a | 3 | 8.8% |
| c | 2 | 5.9% |
| m | 2 | 5.9% |
| r | 2 | 5.9% |
| p | 1 | 2.9% |
| b | 1 | 2.9% |
| Other values (9) | 9 |
Common
| Value | Count | Frequency (%) |
| 5 | ||
| , | 2 | 25.0% |
| - | 1 | 12.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 42 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 5 | ||
| o | 5 | |
| t | 3 | 7.1% |
| e | 3 | 7.1% |
| i | 3 | 7.1% |
| a | 3 | 7.1% |
| c | 2 | 4.8% |
| m | 2 | 4.8% |
| r | 2 | 4.8% |
| , | 2 | 4.8% |
| Other values (12) | 12 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | SOUTH_AMERICA |
|---|
| Value | Count | Frequency (%) |
| south_america | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 2 | |
| S | 1 | |
| O | 1 | |
| U | 1 | |
| T | 1 | |
| H | 1 | |
| _ | 1 | |
| M | 1 | |
| E | 1 | |
| R | 1 | |
| Other values (2) | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 12 | |
| Connector Punctuation | 1 | 7.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2 | |
| S | 1 | |
| O | 1 | |
| U | 1 | |
| T | 1 | |
| H | 1 | |
| M | 1 | |
| E | 1 | |
| R | 1 | |
| I | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12 | |
| Common | 1 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 2 | |
| S | 1 | |
| O | 1 | |
| U | 1 | |
| T | 1 | |
| H | 1 | |
| M | 1 | |
| E | 1 | |
| R | 1 | |
| I | 1 |
Common
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 2 | |
| S | 1 | |
| O | 1 | |
| U | 1 | |
| T | 1 | |
| H | 1 | |
| _ | 1 | |
| M | 1 | |
| E | 1 | |
| R | 1 | |
| Other values (2) | 2 |
lowestBiostratigraphicZone
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 7296210 |
|---|
| Value | Count | Frequency (%) |
| 7296210 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 7 | 1 | |
| 9 | 1 | |
| 6 | 1 | |
| 1 | 1 | |
| 0 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 7 | 1 | |
| 9 | 1 | |
| 6 | 1 | |
| 1 | 1 | |
| 0 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 7 | 1 | |
| 9 | 1 | |
| 6 | 1 | |
| 1 | 1 | |
| 0 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 7 | 1 | |
| 9 | 1 | |
| 6 | 1 | |
| 1 | 1 | |
| 0 | 1 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | CO |
|---|
| Value | Count | Frequency (%) |
| co | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| C | 1 | |
| O | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 1 | |
| O | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 1 | |
| O | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| C | 1 | |
| O | 1 |
group
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Meta |
|---|
| Value | Count | Frequency (%) |
| meta | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| M | 1 | |
| e | 1 | |
| t | 1 | |
| a | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 3 | |
| Uppercase Letter | 1 | 25.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1 | |
| t | 1 | |
| a | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| M | 1 | |
| e | 1 | |
| t | 1 | |
| a | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| M | 1 | |
| e | 1 | |
| t | 1 | |
| a | 1 |
bed
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988400 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 23.5 |
| Mean length | 23.5 |
| Min length | 15 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Rinorea pubiflora var. pubiflora |
|---|---|
| 2nd row | Villa Vicencia. |
| Value | Count | Frequency (%) |
| pubiflora | 2 | |
| rinorea | 1 | |
| var | 1 | |
| villa | 1 | |
| vicencia | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 6 | |
| i | 6 | |
| r | 4 | 8.5% |
| 4 | 8.5% | |
| l | 4 | 8.5% |
| o | 3 | 6.4% |
| p | 2 | 4.3% |
| f | 2 | 4.3% |
| V | 2 | 4.3% |
| . | 2 | 4.3% |
| Other values (7) | 12 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 38 | |
| Space Separator | 4 | 8.5% |
| Uppercase Letter | 3 | 6.4% |
| Other Punctuation | 2 | 4.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 6 | |
| i | 6 | |
| r | 4 | |
| l | 4 | |
| o | 3 | |
| p | 2 | 5.3% |
| f | 2 | 5.3% |
| c | 2 | 5.3% |
| b | 2 | 5.3% |
| u | 2 | 5.3% |
| Other values (3) | 5 |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 2 | |
| R | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 4 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 41 | |
| Common | 6 | 12.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 6 | |
| i | 6 | |
| r | 4 | |
| l | 4 | |
| o | 3 | 7.3% |
| p | 2 | 4.9% |
| f | 2 | 4.9% |
| V | 2 | 4.9% |
| c | 2 | 4.9% |
| b | 2 | 4.9% |
| Other values (5) | 8 |
Common
| Value | Count | Frequency (%) |
| 4 | ||
| . | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 47 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 6 | |
| i | 6 | |
| r | 4 | 8.5% |
| 4 | 8.5% | |
| l | 4 | 8.5% |
| o | 3 | 6.4% |
| p | 2 | 4.3% |
| f | 2 | 4.3% |
| V | 2 | 4.3% |
| . | 2 | 4.3% |
| Other values (7) | 12 |
Missing 
| Distinct | 16 |
|---|---|
| Distinct (%) | 0.7% |
| Missing | 985985 |
| Missing (%) | 99.8% |
| Memory size | 7.5 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 3 |
| Mean length | 4.395531651 |
| Min length | 2 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | 0.1% |
Sample
| 1st row | cf. |
|---|---|
| 2nd row | cf. |
| 3rd row | cf. |
| 4th row | vel aff. |
| 5th row | vel aff. |
| Value | Count | Frequency (%) |
| cf | 1295 | |
| aff | 610 | |
| uncertain | 368 | 14.6% |
| s.l | 125 | 5.0% |
| vel | 77 | 3.1% |
| sp | 15 | 0.6% |
| near | 13 | 0.5% |
| nov | 13 | 0.5% |
| s.s | 5 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| f | 2515 | |
| . | 2176 | |
| c | 1663 | |
| a | 991 | 9.3% |
| n | 762 | 7.2% |
| e | 458 | 4.3% |
| r | 381 | 3.6% |
| i | 368 | 3.5% |
| t | 368 | 3.5% |
| u | 365 | 3.4% |
| Other values (7) | 577 | 5.4% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8341 | |
| Other Punctuation | 2176 | 20.5% |
| Space Separator | 104 | 1.0% |
| Uppercase Letter | 3 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 2515 | |
| c | 1663 | |
| a | 991 | 11.9% |
| n | 762 | 9.1% |
| e | 458 | 5.5% |
| r | 381 | 4.6% |
| i | 368 | 4.4% |
| t | 368 | 4.4% |
| u | 365 | 4.4% |
| l | 202 | 2.4% |
| Other values (4) | 268 | 3.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 2176 |
Space Separator
| Value | Count | Frequency (%) |
| 104 |
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 3 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8344 | |
| Common | 2280 | 21.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| f | 2515 | |
| c | 1663 | |
| a | 991 | 11.9% |
| n | 762 | 9.1% |
| e | 458 | 5.5% |
| r | 381 | 4.6% |
| i | 368 | 4.4% |
| t | 368 | 4.4% |
| u | 365 | 4.4% |
| l | 202 | 2.4% |
| Other values (5) | 271 | 3.2% |
Common
| Value | Count | Frequency (%) |
| . | 2176 | |
| 104 | 4.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10624 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| f | 2515 | |
| . | 2176 | |
| c | 1663 | |
| a | 991 | 9.3% |
| n | 762 | 7.2% |
| e | 458 | 4.3% |
| r | 381 | 3.6% |
| i | 368 | 3.5% |
| t | 368 | 3.5% |
| u | 365 | 3.4% |
| Other values (7) | 577 | 5.4% |
typeStatus
Text
Missing 
| Distinct | 13 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 967033 |
| Missing (%) | 97.8% |
| Memory size | 7.5 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 7 |
| Mean length | 7.474893537 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | ISOTYPE |
|---|---|
| 2nd row | ISOTYPE |
| 3rd row | HOLOTYPE |
| 4th row | ISOTYPE |
| 5th row | ISOTYPE |
| Value | Count | Frequency (%) |
| isotype | 13211 | |
| holotype | 4263 | 19.9% |
| isosyntype | 1377 | 6.4% |
| syntype | 1202 | 5.6% |
| type | 444 | 2.1% |
| isolectotype | 434 | 2.0% |
| lectotype | 195 | 0.9% |
| isoneotype | 97 | 0.5% |
| paratype | 75 | 0.4% |
| neotype | 46 | 0.2% |
| Other values (3) | 25 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| O | 24437 | |
| Y | 23932 | |
| E | 22146 | |
| T | 21998 | |
| P | 21437 | |
| S | 17702 | |
| I | 15176 | |
| L | 4924 | 3.1% |
| H | 4263 | 2.7% |
| N | 2738 | 1.7% |
| Other values (5) | 978 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 159731 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 24437 | |
| Y | 23932 | |
| E | 22146 | |
| T | 21998 | |
| P | 21437 | |
| S | 17702 | |
| I | 15176 | |
| L | 4924 | 3.1% |
| H | 4263 | 2.7% |
| N | 2738 | 1.7% |
| Other values (5) | 978 | 0.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 159731 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| O | 24437 | |
| Y | 23932 | |
| E | 22146 | |
| T | 21998 | |
| P | 21437 | |
| S | 17702 | |
| I | 15176 | |
| L | 4924 | 3.1% |
| H | 4263 | 2.7% |
| N | 2738 | 1.7% |
| Other values (5) | 978 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 159731 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| O | 24437 | |
| Y | 23932 | |
| E | 22146 | |
| T | 21998 | |
| P | 21437 | |
| S | 17702 | |
| I | 15176 | |
| L | 4924 | 3.1% |
| H | 4263 | 2.7% |
| N | 2738 | 1.7% |
| Other values (5) | 978 | 0.6% |
identifiedBy
Text
Missing 
| Distinct | 4879 |
|---|---|
| Distinct (%) | 4.0% |
| Missing | 866335 |
| Missing (%) | 87.7% |
| Memory size | 7.5 MiB |
Length
| Max length | 131 |
|---|---|
| Median length | 108 |
| Mean length | 37.6902275 |
| Min length | 3 |
Unique
| Unique | 1781 ? |
|---|---|
| Unique (%) | 1.5% |
Sample
| 1st row | Blair, S. M. |
|---|---|
| 2nd row | Acevedo-Rodríguez, P., (BOT), Smithsonian Institution - National Museum of Natural History (UNITED STATES) |
| 3rd row | Acevedo-Rodríguez, P., (BOT), Smithsonian Institution - National Museum of Natural History (UNITED STATES) |
| 4th row | Wagner, W. L., (BOT), Smithsonian Institution - National Museum of Natural History (UNITED STATES) |
| 5th row | Wagner, W. L., (BOT), Smithsonian Institution - National Museum of Natural History (UNITED STATES) |
| Value | Count | Frequency (%) |
| united | 29831 | 4.2% |
| states | 29821 | 4.2% |
| of | 27693 | 3.9% |
| 27094 | 3.8% | |
| national | 26421 | 3.7% |
| museum | 26206 | 3.6% |
| smithsonian | 26080 | 3.6% |
| natural | 26024 | 3.6% |
| history | 26005 | 3.6% |
| institution | 26002 | 3.6% |
| Other values (4341) | 447270 |
Most occurring characters
| Value | Count | Frequency (%) |
| 596380 | 13.0% | |
| t | 259398 | 5.6% |
| a | 250409 | 5.4% |
| o | 245191 | 5.3% |
| i | 229198 | 5.0% |
| n | 225319 | 4.9% |
| , | 198332 | 4.3% |
| . | 187693 | 4.1% |
| r | 186814 | 4.1% |
| e | 183391 | 4.0% |
| Other values (81) | 2038608 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2399509 | |
| Uppercase Letter | 1037834 | |
| Space Separator | 596380 | 13.0% |
| Other Punctuation | 392196 | 8.5% |
| Open Punctuation | 70430 | 1.5% |
| Close Punctuation | 70430 | 1.5% |
| Dash Punctuation | 33950 | 0.7% |
| Decimal Number | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 259398 | |
| a | 250409 | |
| o | 245191 | |
| i | 229198 | |
| n | 225319 | |
| r | 186814 | |
| e | 183391 | |
| u | 150111 | 6.3% |
| s | 150086 | 6.3% |
| l | 109220 | 4.6% |
| Other values (33) | 410372 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 134600 | |
| T | 126793 | |
| N | 98928 | 9.5% |
| E | 85767 | 8.3% |
| I | 62658 | 6.0% |
| A | 62027 | 6.0% |
| M | 58662 | 5.7% |
| D | 56356 | 5.4% |
| U | 48305 | 4.7% |
| H | 46164 | 4.4% |
| Other values (20) | 257574 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 198332 | |
| . | 187693 | |
| ; | 5701 | 1.5% |
| " | 272 | 0.1% |
| ' | 140 | < 0.1% |
| & | 42 | < 0.1% |
| ¡ | 12 | < 0.1% |
| ? | 3 | < 0.1% |
| / | 1 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 9 | 2 | |
| 1 | 1 | |
| 2 | 1 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 69931 | |
| [ | 499 | 0.7% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 69931 | |
| ] | 499 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 596380 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 33950 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3437343 | |
| Common | 1163390 | 25.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 259398 | 7.5% |
| a | 250409 | 7.3% |
| o | 245191 | 7.1% |
| i | 229198 | 6.7% |
| n | 225319 | 6.6% |
| r | 186814 | 5.4% |
| e | 183391 | 5.3% |
| u | 150111 | 4.4% |
| s | 150086 | 4.4% |
| S | 134600 | 3.9% |
| Other values (63) | 1422826 |
Common
| Value | Count | Frequency (%) |
| 596380 | ||
| , | 198332 | 17.0% |
| . | 187693 | 16.1% |
| ( | 69931 | 6.0% |
| ) | 69931 | 6.0% |
| - | 33950 | 2.9% |
| ; | 5701 | 0.5% |
| [ | 499 | < 0.1% |
| ] | 499 | < 0.1% |
| " | 272 | < 0.1% |
| Other values (8) | 202 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4593697 | |
| None | 7036 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 596380 | 13.0% | |
| t | 259398 | 5.6% |
| a | 250409 | 5.5% |
| o | 245191 | 5.3% |
| i | 229198 | 5.0% |
| n | 225319 | 4.9% |
| , | 198332 | 4.3% |
| . | 187693 | 4.1% |
| r | 186814 | 4.1% |
| e | 183391 | 4.0% |
| Other values (59) | 2031572 |
None
| Value | Count | Frequency (%) |
| í | 3953 | |
| á | 810 | 11.5% |
| é | 712 | 10.1% |
| ñ | 329 | 4.7% |
| ö | 311 | 4.4% |
| ü | 236 | 3.4% |
| ó | 220 | 3.1% |
| ä | 193 | 2.7% |
| ã | 71 | 1.0% |
| ú | 64 | 0.9% |
| Other values (12) | 137 | 1.9% |
dateIdentified
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 59 |
|---|---|
| Median length | 59 |
| Mean length | 59 |
| Min length | 59 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Plantae, Dicotyledonae, Malpighiales, Violaceae, Violoideae |
|---|
| Value | Count | Frequency (%) |
| plantae | 1 | |
| dicotyledonae | 1 | |
| malpighiales | 1 | |
| violaceae | 1 | |
| violoideae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 8 | |
| e | 8 | |
| i | 6 | |
| l | 6 | |
| o | 5 | |
| , | 4 | 6.8% |
| 4 | 6.8% | |
| c | 2 | 3.4% |
| d | 2 | 3.4% |
| V | 2 | 3.4% |
| Other values (10) | 12 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 46 | |
| Uppercase Letter | 5 | 8.5% |
| Other Punctuation | 4 | 6.8% |
| Space Separator | 4 | 6.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 8 | |
| e | 8 | |
| i | 6 | |
| l | 6 | |
| o | 5 | |
| c | 2 | 4.3% |
| d | 2 | 4.3% |
| t | 2 | 4.3% |
| n | 2 | 4.3% |
| y | 1 | 2.2% |
| Other values (4) | 4 |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 2 | |
| D | 1 | |
| M | 1 | |
| P | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 4 |
Space Separator
| Value | Count | Frequency (%) |
| 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 51 | |
| Common | 8 | 13.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 8 | |
| e | 8 | |
| i | 6 | |
| l | 6 | |
| o | 5 | |
| c | 2 | 3.9% |
| d | 2 | 3.9% |
| V | 2 | 3.9% |
| t | 2 | 3.9% |
| n | 2 | 3.9% |
| Other values (8) | 8 |
Common
| Value | Count | Frequency (%) |
| , | 4 | |
| 4 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 59 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 8 | |
| e | 8 | |
| i | 6 | |
| l | 6 | |
| o | 5 | |
| , | 4 | 6.8% |
| 4 | 6.8% | |
| c | 2 | 3.4% |
| d | 2 | 3.4% |
| V | 2 | 3.4% |
| Other values (10) | 12 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Plantae |
|---|
| Value | Count | Frequency (%) |
| plantae | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2 | |
| P | 1 | |
| l | 1 | |
| n | 1 | |
| t | 1 | |
| e | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6 | |
| Uppercase Letter | 1 | 14.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 | |
| l | 1 | |
| n | 1 | |
| t | 1 | |
| e | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2 | |
| P | 1 | |
| l | 1 | |
| n | 1 | |
| t | 1 | |
| e | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2 | |
| P | 1 | |
| l | 1 | |
| n | 1 | |
| t | 1 | |
| e | 1 |
identificationVerificationStatus
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 12 |
| Min length | 12 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Tracheophyta |
|---|
| Value | Count | Frequency (%) |
| tracheophyta | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2 | |
| h | 2 | |
| T | 1 | |
| r | 1 | |
| c | 1 | |
| e | 1 | |
| o | 1 | |
| p | 1 | |
| y | 1 | |
| t | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11 | |
| Uppercase Letter | 1 | 8.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 | |
| h | 2 | |
| r | 1 | |
| c | 1 | |
| e | 1 | |
| o | 1 | |
| p | 1 | |
| y | 1 | |
| t | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2 | |
| h | 2 | |
| T | 1 | |
| r | 1 | |
| c | 1 | |
| e | 1 | |
| o | 1 | |
| p | 1 | |
| y | 1 | |
| t | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2 | |
| h | 2 | |
| T | 1 | |
| r | 1 | |
| c | 1 | |
| e | 1 | |
| o | 1 | |
| p | 1 | |
| y | 1 | |
| t | 1 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Magnoliopsida |
|---|
| Value | Count | Frequency (%) |
| magnoliopsida | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2 | |
| o | 2 | |
| i | 2 | |
| M | 1 | |
| g | 1 | |
| n | 1 | |
| l | 1 | |
| p | 1 | |
| s | 1 | |
| d | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12 | |
| Uppercase Letter | 1 | 7.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 | |
| o | 2 | |
| i | 2 | |
| g | 1 | |
| n | 1 | |
| l | 1 | |
| p | 1 | |
| s | 1 | |
| d | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2 | |
| o | 2 | |
| i | 2 | |
| M | 1 | |
| g | 1 | |
| n | 1 | |
| l | 1 | |
| p | 1 | |
| s | 1 | |
| d | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2 | |
| o | 2 | |
| i | 2 | |
| M | 1 | |
| g | 1 | |
| n | 1 | |
| l | 1 | |
| p | 1 | |
| s | 1 | |
| d | 1 |
taxonID
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 12 |
| Mean length | 12 |
| Min length | 12 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Malpighiales |
|---|
| Value | Count | Frequency (%) |
| malpighiales | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2 | |
| l | 2 | |
| i | 2 | |
| M | 1 | |
| p | 1 | |
| g | 1 | |
| h | 1 | |
| e | 1 | |
| s | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11 | |
| Uppercase Letter | 1 | 8.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 | |
| l | 2 | |
| i | 2 | |
| p | 1 | |
| g | 1 | |
| h | 1 | |
| e | 1 | |
| s | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2 | |
| l | 2 | |
| i | 2 | |
| M | 1 | |
| p | 1 | |
| g | 1 | |
| h | 1 | |
| e | 1 | |
| s | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2 | |
| l | 2 | |
| i | 2 | |
| M | 1 | |
| p | 1 | |
| g | 1 | |
| h | 1 | |
| e | 1 | |
| s | 1 |
| Distinct | 141149 |
|---|---|
| Distinct (%) | 14.3% |
| Missing | 3368 |
| Missing (%) | 0.3% |
| Memory size | 7.5 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 7 |
| Mean length | 7.000691347 |
| Min length | 1 |
Unique
| Unique | 52485 ? |
|---|---|
| Unique (%) | 5.3% |
Sample
| 1st row | 2654944 |
|---|---|
| 2nd row | 2947270 |
| 3rd row | 10416230 |
| 4th row | 3687053 |
| 5th row | 7355530 |
| Value | Count | Frequency (%) |
| 7947184 | 4001 | 0.4% |
| 2655370 | 1415 | 0.1% |
| 6 | 1163 | 0.1% |
| 3219107 | 1082 | 0.1% |
| 5426909 | 1064 | 0.1% |
| 2702678 | 1008 | 0.1% |
| 5426949 | 994 | 0.1% |
| 2654909 | 868 | 0.1% |
| 2655497 | 809 | 0.1% |
| 5426932 | 760 | 0.1% |
| Other values (141139) | 971870 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 898604 | |
| 3 | 797962 | |
| 7 | 733166 | |
| 5 | 715553 | |
| 0 | 648866 | |
| 1 | 638677 | |
| 8 | 635333 | |
| 6 | 626367 | |
| 9 | 613562 | |
| 4 | 587820 | |
| Other values (7) | 9 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6895910 | |
| Lowercase Letter | 8 | < 0.1% |
| Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 898604 | |
| 3 | 797962 | |
| 7 | 733166 | |
| 5 | 715553 | |
| 0 | 648866 | |
| 1 | 638677 | |
| 8 | 635333 | |
| 6 | 626367 | |
| 9 | 613562 | |
| 4 | 587820 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 | |
| e | 2 | |
| i | 1 | |
| o | 1 | |
| l | 1 | |
| c | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6895910 | |
| Latin | 9 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 898604 | |
| 3 | 797962 | |
| 7 | 733166 | |
| 5 | 715553 | |
| 0 | 648866 | |
| 1 | 638677 | |
| 8 | 635333 | |
| 6 | 626367 | |
| 9 | 613562 | |
| 4 | 587820 |
Latin
| Value | Count | Frequency (%) |
| a | 2 | |
| e | 2 | |
| V | 1 | |
| i | 1 | |
| o | 1 | |
| l | 1 | |
| c | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6895919 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 898604 | |
| 3 | 797962 | |
| 7 | 733166 | |
| 5 | 715553 | |
| 0 | 648866 | |
| 1 | 638677 | |
| 8 | 635333 | |
| 6 | 626367 | |
| 9 | 613562 | |
| 4 | 587820 | |
| Other values (7) | 9 | < 0.1% |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Rinorea |
|---|
| Value | Count | Frequency (%) |
| rinorea | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 1 | |
| i | 1 | |
| n | 1 | |
| o | 1 | |
| r | 1 | |
| e | 1 | |
| a | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6 | |
| Uppercase Letter | 1 | 14.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 1 | |
| n | 1 | |
| o | 1 | |
| r | 1 | |
| e | 1 | |
| a | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 1 | |
| i | 1 | |
| n | 1 | |
| o | 1 | |
| r | 1 | |
| e | 1 | |
| a | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 1 | |
| i | 1 | |
| n | 1 | |
| o | 1 | |
| r | 1 | |
| e | 1 | |
| a | 1 |
taxonConceptID
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Rinorea |
|---|
| Value | Count | Frequency (%) |
| rinorea | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 1 | |
| i | 1 | |
| n | 1 | |
| o | 1 | |
| r | 1 | |
| e | 1 | |
| a | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6 | |
| Uppercase Letter | 1 | 14.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 1 | |
| n | 1 | |
| o | 1 | |
| r | 1 | |
| e | 1 | |
| a | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 1 | |
| i | 1 | |
| n | 1 | |
| o | 1 | |
| r | 1 | |
| e | 1 | |
| a | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 1 | |
| i | 1 | |
| n | 1 | |
| o | 1 | |
| r | 1 | |
| e | 1 | |
| a | 1 |
scientificName
Text
| Distinct | 171484 |
|---|---|
| Distinct (%) | 17.3% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Memory size | 7.5 MiB |
Length
| Max length | 145 |
|---|---|
| Median length | 90 |
| Mean length | 31.1400224 |
| Min length | 5 |
Unique
| Unique | 76155 ? |
|---|---|
| Unique (%) | 7.7% |
Sample
| 1st row | Lithothamnion calcareum (Pallas) Areschoug |
|---|---|
| 2nd row | Amicia glandulosa Kunth |
| 3rd row | Tripogandra glandulosa (Seub.) Rohweder |
| 4th row | Connarus steyermarkii Prance |
| 5th row | Trichoneura grandiglumis (Nees) Ekman |
| Value | Count | Frequency (%) |
| l | 155063 | 4.1% |
| 123403 | 3.2% | |
| ex | 71536 | 1.9% |
| var | 42961 | 1.1% |
| kunth | 25715 | 0.7% |
| dc | 25369 | 0.7% |
| benth | 22482 | 0.6% |
| a.gray | 22453 | 0.6% |
| subsp | 20360 | 0.5% |
| sw | 19134 | 0.5% |
| Other values (72296) | 3270675 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2810752 | 9.1% | |
| a | 2784580 | 9.0% |
| i | 2163680 | 7.0% |
| e | 1918469 | 6.2% |
| r | 1705055 | 5.5% |
| l | 1505257 | 4.9% |
| o | 1502652 | 4.9% |
| n | 1425654 | 4.6% |
| s | 1389295 | 4.5% |
| . | 1385249 | 4.5% |
| Other values (108) | 12188124 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 22543815 | |
| Uppercase Letter | 2972512 | 9.7% |
| Space Separator | 2810752 | 9.1% |
| Other Punctuation | 1548996 | 5.0% |
| Open Punctuation | 381562 | 1.2% |
| Close Punctuation | 381562 | 1.2% |
| Decimal Number | 125780 | 0.4% |
| Dash Punctuation | 11127 | < 0.1% |
| Math Symbol | 2640 | < 0.1% |
| Connector Punctuation | 21 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2784580 | |
| i | 2163680 | 9.6% |
| e | 1918469 | 8.5% |
| r | 1705055 | 7.6% |
| l | 1505257 | 6.7% |
| o | 1502652 | 6.7% |
| n | 1425654 | 6.3% |
| s | 1389295 | 6.2% |
| u | 1233185 | 5.5% |
| t | 1179720 | 5.2% |
| Other values (51) | 5736268 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 300531 | 10.1% |
| S | 279166 | 9.4% |
| C | 270055 | 9.1% |
| P | 213524 | 7.2% |
| A | 207708 | 7.0% |
| M | 200072 | 6.7% |
| B | 195748 | 6.6% |
| H | 174615 | 5.9% |
| R | 140797 | 4.7% |
| D | 140382 | 4.7% |
| Other values (27) | 849914 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 36281 | |
| 8 | 25539 | |
| 9 | 16579 | |
| 2 | 7723 | 6.1% |
| 3 | 7598 | 6.0% |
| 7 | 7534 | 6.0% |
| 0 | 7406 | 5.9% |
| 4 | 6583 | 5.2% |
| 6 | 5657 | 4.5% |
| 5 | 4880 | 3.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1385249 | |
| & | 123403 | 8.0% |
| , | 38852 | 2.5% |
| ' | 1492 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2810752 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 381562 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 381562 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 11127 |
Math Symbol
| Value | Count | Frequency (%) |
| × | 2640 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 21 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 25516327 | |
| Common | 5262440 | 17.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2784580 | 10.9% |
| i | 2163680 | 8.5% |
| e | 1918469 | 7.5% |
| r | 1705055 | 6.7% |
| l | 1505257 | 5.9% |
| o | 1502652 | 5.9% |
| n | 1425654 | 5.6% |
| s | 1389295 | 5.4% |
| u | 1233185 | 4.8% |
| t | 1179720 | 4.6% |
| Other values (88) | 8708780 |
Common
| Value | Count | Frequency (%) |
| 2810752 | ||
| . | 1385249 | |
| ( | 381562 | 7.3% |
| ) | 381562 | 7.3% |
| & | 123403 | 2.3% |
| , | 38852 | 0.7% |
| 1 | 36281 | 0.7% |
| 8 | 25539 | 0.5% |
| 9 | 16579 | 0.3% |
| - | 11127 | 0.2% |
| Other values (10) | 51534 | 1.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30731098 | |
| None | 47669 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2810752 | 9.1% | |
| a | 2784580 | 9.1% |
| i | 2163680 | 7.0% |
| e | 1918469 | 6.2% |
| r | 1705055 | 5.5% |
| l | 1505257 | 4.9% |
| o | 1502652 | 4.9% |
| n | 1425654 | 4.6% |
| s | 1389295 | 4.5% |
| . | 1385249 | 4.5% |
| Other values (61) | 12140455 |
None
| Value | Count | Frequency (%) |
| ü | 15283 | |
| é | 9353 | |
| ö | 6844 | |
| × | 2640 | 5.5% |
| ä | 2495 | 5.2% |
| á | 2384 | 5.0% |
| Á | 1742 | 3.7% |
| ø | 1175 | 2.5% |
| è | 888 | 1.9% |
| ó | 870 | 1.8% |
| Other values (37) | 3995 | 8.4% |
parentNameUsage
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | pubiflora |
|---|
| Value | Count | Frequency (%) |
| pubiflora | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| p | 1 | |
| u | 1 | |
| b | 1 | |
| i | 1 | |
| f | 1 | |
| l | 1 | |
| o | 1 | |
| r | 1 | |
| a | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| p | 1 | |
| u | 1 | |
| b | 1 | |
| i | 1 | |
| f | 1 | |
| l | 1 | |
| o | 1 | |
| r | 1 | |
| a | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| p | 1 | |
| u | 1 | |
| b | 1 | |
| i | 1 | |
| f | 1 | |
| l | 1 | |
| o | 1 | |
| r | 1 | |
| a | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| p | 1 | |
| u | 1 | |
| b | 1 | |
| i | 1 | |
| f | 1 | |
| l | 1 | |
| o | 1 | |
| r | 1 | |
| a | 1 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 9 |
|---|---|
| Median length | 9 |
| Mean length | 9 |
| Min length | 9 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | pubiflora |
|---|
| Value | Count | Frequency (%) |
| pubiflora | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| p | 1 | |
| u | 1 | |
| b | 1 | |
| i | 1 | |
| f | 1 | |
| l | 1 | |
| o | 1 | |
| r | 1 | |
| a | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| p | 1 | |
| u | 1 | |
| b | 1 | |
| i | 1 | |
| f | 1 | |
| l | 1 | |
| o | 1 | |
| r | 1 | |
| a | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| p | 1 | |
| u | 1 | |
| b | 1 | |
| i | 1 | |
| f | 1 | |
| l | 1 | |
| o | 1 | |
| r | 1 | |
| a | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| p | 1 | |
| u | 1 | |
| b | 1 | |
| i | 1 | |
| f | 1 | |
| l | 1 | |
| o | 1 | |
| r | 1 | |
| a | 1 |
namePublishedIn
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | VARIETY |
|---|
| Value | Count | Frequency (%) |
| variety | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| V | 1 | |
| A | 1 | |
| R | 1 | |
| I | 1 | |
| E | 1 | |
| T | 1 | |
| Y | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 7 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 1 | |
| A | 1 | |
| R | 1 | |
| I | 1 | |
| E | 1 | |
| T | 1 | |
| Y | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| V | 1 | |
| A | 1 | |
| R | 1 | |
| I | 1 | |
| E | 1 | |
| T | 1 | |
| Y | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| V | 1 | |
| A | 1 | |
| R | 1 | |
| I | 1 | |
| E | 1 | |
| T | 1 | |
| Y | 1 |
| Distinct | 1871 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 3060 |
| Missing (%) | 0.3% |
| Memory size | 7.5 MiB |
Length
| Max length | 106 |
|---|---|
| Median length | 83 |
| Mean length | 55.78930767 |
| Min length | 6 |
Unique
| Unique | 247 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Plantae, Rhodophyta, Corallinales, Lithothamniaceae |
|---|---|
| 2nd row | Plantae, Dicotyledonae, Fabales, Fabaceae, Papilionoideae |
| 3rd row | Plantae, Monocotyledonae, Commelinales, Commelinaceae |
| 4th row | Plantae, Dicotyledonae, Oxalidales, Connaraceae |
| 5th row | Plantae, Monocotyledonae, Poales, Poaceae, Chloridoideae |
| Value | Count | Frequency (%) |
| plantae | 906960 | 19.6% |
| dicotyledonae | 565444 | 12.2% |
| monocotyledonae | 198988 | 4.3% |
| poales | 153711 | 3.3% |
| poaceae | 110119 | 2.4% |
| asterales | 83265 | 1.8% |
| asteraceae | 78409 | 1.7% |
| asteroideae | 62020 | 1.3% |
| pteridophyte | 60609 | 1.3% |
| lamiales | 58285 | 1.3% |
| Other values (1989) | 2357982 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 7742674 | |
| e | 7698551 | |
| o | 4128767 | 7.5% |
| 3650450 | 6.6% | |
| , | 3626340 | 6.6% |
| l | 3562838 | 6.5% |
| n | 2796731 | 5.1% |
| t | 2746790 | 5.0% |
| i | 2727025 | 5.0% |
| c | 2484340 | 4.5% |
| Other values (50) | 13807042 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 43041381 | |
| Uppercase Letter | 4611679 | 8.4% |
| Space Separator | 3650450 | 6.6% |
| Other Punctuation | 3630777 | 6.6% |
| Close Punctuation | 18607 | < 0.1% |
| Open Punctuation | 18607 | < 0.1% |
| Dash Punctuation | 47 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 7742674 | |
| e | 7698551 | |
| o | 4128767 | |
| l | 3562838 | |
| n | 2796731 | 6.5% |
| t | 2746790 | 6.4% |
| i | 2727025 | 6.3% |
| c | 2484340 | 5.8% |
| s | 1741258 | 4.0% |
| d | 1740291 | 4.0% |
| Other values (17) | 5672116 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1556974 | |
| D | 610750 | 13.2% |
| A | 433977 | 9.4% |
| M | 401816 | 8.7% |
| C | 333336 | 7.2% |
| L | 197314 | 4.3% |
| F | 191584 | 4.2% |
| R | 167750 | 3.6% |
| B | 159640 | 3.5% |
| S | 141071 | 3.1% |
| Other values (16) | 417467 | 9.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 3626340 | |
| . | 4436 | 0.1% |
| ? | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 3650450 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 18607 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 18607 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 47 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 47653060 | |
| Common | 7318488 | 13.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 7742674 | |
| e | 7698551 | |
| o | 4128767 | |
| l | 3562838 | 7.5% |
| n | 2796731 | 5.9% |
| t | 2746790 | 5.8% |
| i | 2727025 | 5.7% |
| c | 2484340 | 5.2% |
| s | 1741258 | 3.7% |
| d | 1740291 | 3.7% |
| Other values (43) | 10283795 |
Common
| Value | Count | Frequency (%) |
| 3650450 | ||
| , | 3626340 | |
| ) | 18607 | 0.3% |
| ( | 18607 | 0.3% |
| . | 4436 | 0.1% |
| - | 47 | < 0.1% |
| ? | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 54971425 | |
| None | 123 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 7742674 | |
| e | 7698551 | |
| o | 4128767 | 7.5% |
| 3650450 | 6.6% | |
| , | 3626340 | 6.6% |
| l | 3562838 | 6.5% |
| n | 2796731 | 5.1% |
| t | 2746790 | 5.0% |
| i | 2727025 | 5.0% |
| c | 2484340 | 4.5% |
| Other values (49) | 13806919 |
None
| Value | Count | Frequency (%) |
| ö | 123 |
kingdom
Text
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Memory size | 7.5 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 7 |
| Mean length | 6.971155373 |
| Min length | 5 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Plantae |
|---|---|
| 2nd row | Plantae |
| 3rd row | Plantae |
| 4th row | Plantae |
| 5th row | Plantae |
| Value | Count | Frequency (%) |
| plantae | 907311 | |
| fungi | 48945 | 4.9% |
| chromista | 17041 | 1.7% |
| bacteria | 11701 | 1.2% |
| incertae | 3366 | 0.3% |
| sedis | 3366 | 0.3% |
| protozoa | 31 | < 0.1% |
| animalia | 4 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1858470 | |
| n | 959626 | |
| t | 939450 | |
| e | 929110 | |
| P | 907342 | |
| l | 907315 | |
| i | 84427 | 1.2% |
| F | 48945 | 0.7% |
| u | 48945 | 0.7% |
| g | 48945 | 0.7% |
| Other values (12) | 157708 | 2.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5901884 | |
| Uppercase Letter | 985033 | 14.3% |
| Space Separator | 3366 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1858470 | |
| n | 959626 | |
| t | 939450 | |
| e | 929110 | |
| l | 907315 | |
| i | 84427 | 1.4% |
| u | 48945 | 0.8% |
| g | 48945 | 0.8% |
| r | 32139 | 0.5% |
| s | 23773 | 0.4% |
| Other values (6) | 69684 | 1.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 907342 | |
| F | 48945 | 5.0% |
| C | 17041 | 1.7% |
| B | 11701 | 1.2% |
| A | 4 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 3366 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6886917 | |
| Common | 3366 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1858470 | |
| n | 959626 | |
| t | 939450 | |
| e | 929110 | |
| P | 907342 | |
| l | 907315 | |
| i | 84427 | 1.2% |
| F | 48945 | 0.7% |
| u | 48945 | 0.7% |
| g | 48945 | 0.7% |
| Other values (11) | 154342 | 2.2% |
Common
| Value | Count | Frequency (%) |
| 3366 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6890283 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1858470 | |
| n | 959626 | |
| t | 939450 | |
| e | 929110 | |
| P | 907342 | |
| l | 907315 | |
| i | 84427 | 1.2% |
| F | 48945 | 0.7% |
| u | 48945 | 0.7% |
| g | 48945 | 0.7% |
| Other values (12) | 157708 | 2.3% |
phylum
Text
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4754 |
| Missing (%) | 0.5% |
| Memory size | 7.5 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 12 |
| Mean length | 11.72722051 |
| Min length | 7 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Rhodophyta |
|---|---|
| 2nd row | Tracheophyta |
| 3rd row | Tracheophyta |
| 4th row | Tracheophyta |
| 5th row | Tracheophyta |
| Value | Count | Frequency (%) |
| tracheophyta | 830617 | |
| ascomycota | 48276 | 4.9% |
| bryophyta | 32695 | 3.3% |
| rhodophyta | 26385 | 2.7% |
| ochrophyta | 15149 | 1.5% |
| cyanobacteria | 11694 | 1.2% |
| chlorophyta | 9268 | 0.9% |
| marchantiophyta | 5937 | 0.6% |
| myzozoa | 1887 | 0.2% |
| charophyta | 1126 | 0.1% |
| Other values (14) | 614 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1851109 | |
| h | 1809899 | |
| o | 1070187 | |
| y | 1016309 | |
| t | 987912 | |
| c | 960529 | |
| p | 921305 | |
| r | 906622 | |
| e | 842467 | |
| T | 830618 | |
| Other values (22) | 338500 | 2.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10551802 | |
| Uppercase Letter | 983655 | 8.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1851109 | |
| h | 1809899 | |
| o | 1070187 | |
| y | 1016309 | |
| t | 987912 | |
| c | 960529 | |
| p | 921305 | |
| r | 906622 | |
| e | 842467 | |
| m | 48739 | 0.5% |
| Other values (10) | 136724 | 1.3% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 830618 | |
| A | 48406 | 4.9% |
| B | 33146 | 3.4% |
| R | 26385 | 2.7% |
| C | 22097 | 2.2% |
| O | 15149 | 1.5% |
| M | 7826 | 0.8% |
| E | 19 | < 0.1% |
| P | 5 | < 0.1% |
| F | 2 | < 0.1% |
| Other values (2) | 2 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11535457 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1851109 | |
| h | 1809899 | |
| o | 1070187 | |
| y | 1016309 | |
| t | 987912 | |
| c | 960529 | |
| p | 921305 | |
| r | 906622 | |
| e | 842467 | |
| T | 830618 | |
| Other values (22) | 338500 | 2.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 11535457 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1851109 | |
| h | 1809899 | |
| o | 1070187 | |
| y | 1016309 | |
| t | 987912 | |
| c | 960529 | |
| p | 921305 | |
| r | 906622 | |
| e | 842467 | |
| T | 830618 | |
| Other values (22) | 338500 | 2.9% |
class
Text
| Distinct | 68 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5481 |
| Missing (%) | 0.6% |
| Memory size | 7.5 MiB |
Length
| Max length | 19 |
|---|---|
| Median length | 13 |
| Mean length | 12.51019767 |
| Min length | 6 |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Florideophyceae |
|---|---|
| 2nd row | Magnoliopsida |
| 3rd row | Liliopsida |
| 4th row | Magnoliopsida |
| 5th row | Liliopsida |
| Value | Count | Frequency (%) |
| magnoliopsida | 565617 | |
| liliopsida | 199036 | 20.2% |
| polypodiopsida | 54963 | 5.6% |
| lecanoromycetes | 44421 | 4.5% |
| bryopsida | 29396 | 3.0% |
| florideophyceae | 25770 | 2.6% |
| cyanobacteriia | 11282 | 1.1% |
| bacillariophyceae | 8448 | 0.9% |
| ulvophyceae | 8422 | 0.9% |
| phaeophyceae | 6544 | 0.7% |
| Other values (58) | 29022 | 3.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 1977833 | |
| o | 1746862 | |
| a | 1601551 | |
| p | 985390 | |
| d | 957361 | |
| s | 918228 | |
| l | 873356 | |
| n | 649522 | 5.3% |
| g | 573852 | 4.7% |
| M | 566314 | 4.6% |
| Other values (32) | 1446267 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11313615 | |
| Uppercase Letter | 982921 | 8.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 1977833 | |
| o | 1746862 | |
| a | 1601551 | |
| p | 985390 | |
| d | 957361 | |
| s | 918228 | |
| l | 873356 | |
| n | 649522 | 5.7% |
| g | 573852 | 5.1% |
| e | 300397 | 2.7% |
| Other values (13) | 729263 | 6.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 566314 | |
| L | 249439 | |
| P | 67683 | 6.9% |
| B | 38181 | 3.9% |
| F | 25770 | 2.6% |
| C | 13423 | 1.4% |
| U | 8422 | 0.9% |
| J | 5240 | 0.5% |
| D | 2331 | 0.2% |
| A | 2061 | 0.2% |
| Other values (9) | 4057 | 0.4% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12296536 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 1977833 | |
| o | 1746862 | |
| a | 1601551 | |
| p | 985390 | |
| d | 957361 | |
| s | 918228 | |
| l | 873356 | |
| n | 649522 | 5.3% |
| g | 573852 | 4.7% |
| M | 566314 | 4.6% |
| Other values (32) | 1446267 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12296536 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 1977833 | |
| o | 1746862 | |
| a | 1601551 | |
| p | 985390 | |
| d | 957361 | |
| s | 918228 | |
| l | 873356 | |
| n | 649522 | 5.3% |
| g | 573852 | 4.7% |
| M | 566314 | 4.6% |
| Other values (32) | 1446267 |
order
Text
Missing 
| Distinct | 357 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 10135 |
| Missing (%) | 1.0% |
| Memory size | 7.5 MiB |
Length
| Max length | 21 |
|---|---|
| Median length | 18 |
| Mean length | 9.357003763 |
| Min length | 6 |
Unique
| Unique | 38 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Corallinales |
|---|---|
| 2nd row | Fabales |
| 3rd row | Commelinales |
| 4th row | Oxalidales |
| 5th row | Poales |
| Value | Count | Frequency (%) |
| poales | 153750 | 15.7% |
| asterales | 83320 | 8.5% |
| lamiales | 58318 | 6.0% |
| fabales | 55218 | 5.6% |
| malpighiales | 46323 | 4.7% |
| polypodiales | 42295 | 4.3% |
| gentianales | 39541 | 4.0% |
| myrtales | 34933 | 3.6% |
| caryophyllales | 32482 | 3.3% |
| rosales | 28326 | 2.9% |
| Other values (347) | 403761 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1510235 | |
| l | 1284489 | |
| e | 1227651 | |
| s | 1170802 | |
| i | 502041 | 5.5% |
| o | 443593 | 4.8% |
| r | 374076 | 4.1% |
| n | 276338 | 3.0% |
| t | 238442 | 2.6% |
| P | 224898 | 2.5% |
| Other values (39) | 1901083 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8175381 | |
| Uppercase Letter | 978267 | 10.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1510235 | |
| l | 1284489 | |
| e | 1227651 | |
| s | 1170802 | |
| i | 502041 | 6.1% |
| o | 443593 | 5.4% |
| r | 374076 | 4.6% |
| n | 276338 | 3.4% |
| t | 238442 | 2.9% |
| p | 219045 | 2.7% |
| Other values (15) | 928669 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 224898 | |
| A | 127170 | |
| M | 105522 | |
| L | 101257 | |
| C | 84594 | 8.6% |
| F | 64567 | 6.6% |
| S | 54625 | 5.6% |
| G | 49863 | 5.1% |
| R | 40816 | 4.2% |
| E | 31238 | 3.2% |
| Other values (14) | 93717 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9153648 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1510235 | |
| l | 1284489 | |
| e | 1227651 | |
| s | 1170802 | |
| i | 502041 | 5.5% |
| o | 443593 | 4.8% |
| r | 374076 | 4.1% |
| n | 276338 | 3.0% |
| t | 238442 | 2.6% |
| P | 224898 | 2.5% |
| Other values (39) | 1901083 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9153648 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1510235 | |
| l | 1284489 | |
| e | 1227651 | |
| s | 1170802 | |
| i | 502041 | 5.5% |
| o | 443593 | 4.8% |
| r | 374076 | 4.1% |
| n | 276338 | 3.0% |
| t | 238442 | 2.6% |
| P | 224898 | 2.5% |
| Other values (39) | 1901083 |
superfamily
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 36 |
| Min length | 36 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
|---|
| Value | Count | Frequency (%) |
| 821cc27a-e3bb-4bc5-ac34-89ada245069d | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 4 | |
| a | 4 | |
| - | 4 | |
| 2 | 3 | |
| b | 3 | |
| 4 | 3 | |
| 8 | 2 | 5.6% |
| 3 | 2 | 5.6% |
| 5 | 2 | 5.6% |
| 9 | 2 | 5.6% |
| Other values (6) | 7 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 18 | |
| Lowercase Letter | 14 | |
| Dash Punctuation | 4 | 11.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 3 | |
| 4 | 3 | |
| 8 | 2 | |
| 3 | 2 | |
| 5 | 2 | |
| 9 | 2 | |
| 1 | 1 | 5.6% |
| 7 | 1 | 5.6% |
| 0 | 1 | 5.6% |
| 6 | 1 | 5.6% |
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 4 | |
| a | 4 | |
| b | 3 | |
| d | 2 | |
| e | 1 | 7.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 22 | |
| Latin | 14 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 4 | |
| 2 | 3 | |
| 4 | 3 | |
| 8 | 2 | |
| 3 | 2 | |
| 5 | 2 | |
| 9 | 2 | |
| 1 | 1 | 4.5% |
| 7 | 1 | 4.5% |
| 0 | 1 | 4.5% |
Latin
| Value | Count | Frequency (%) |
| c | 4 | |
| a | 4 | |
| b | 3 | |
| d | 2 | |
| e | 1 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 36 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 4 | |
| a | 4 | |
| - | 4 | |
| 2 | 3 | |
| b | 3 | |
| 4 | 3 | |
| 8 | 2 | 5.6% |
| 3 | 2 | 5.6% |
| 5 | 2 | 5.6% |
| 9 | 2 | 5.6% |
| Other values (6) | 7 |
family
Text
Missing 
| Distinct | 1293 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 10432 |
| Missing (%) | 1.1% |
| Memory size | 7.5 MiB |
Length
| Max length | 23 |
|---|---|
| Median length | 20 |
| Mean length | 10.76219925 |
| Min length | 2 |
Unique
| Unique | 122 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Hapalidiaceae |
|---|---|
| 2nd row | Fabaceae |
| 3rd row | Commelinaceae |
| 4th row | Connaraceae |
| 5th row | Poaceae |
| Value | Count | Frequency (%) |
| poaceae | 110118 | 11.3% |
| asteraceae | 78427 | 8.0% |
| fabaceae | 51638 | 5.3% |
| cyperaceae | 30498 | 3.1% |
| rubiaceae | 26201 | 2.7% |
| melastomataceae | 16271 | 1.7% |
| malvaceae | 14761 | 1.5% |
| rosaceae | 14530 | 1.5% |
| parmeliaceae | 14370 | 1.5% |
| lamiaceae | 13720 | 1.4% |
| Other values (1283) | 607436 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2413737 | |
| e | 2328328 | |
| c | 1165771 | |
| i | 467905 | 4.4% |
| r | 452986 | 4.3% |
| o | 448954 | 4.3% |
| l | 342205 | 3.3% |
| t | 293841 | 2.8% |
| n | 274374 | 2.6% |
| s | 219276 | 2.1% |
| Other values (42) | 2117731 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 9547099 | |
| Uppercase Letter | 977990 | 9.3% |
| Connector Punctuation | 19 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2413737 | |
| e | 2328328 | |
| c | 1165771 | |
| i | 467905 | 4.9% |
| r | 452986 | 4.7% |
| o | 448954 | 4.7% |
| l | 342205 | 3.6% |
| t | 293841 | 3.1% |
| n | 274374 | 2.9% |
| s | 219276 | 2.3% |
| Other values (16) | 1139722 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 210065 | |
| A | 147452 | |
| C | 108066 | |
| M | 66499 | 6.8% |
| R | 64279 | 6.6% |
| F | 58492 | 6.0% |
| S | 51973 | 5.3% |
| L | 43467 | 4.4% |
| B | 36909 | 3.8% |
| O | 31591 | 3.2% |
| Other values (15) | 159197 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 19 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10525089 | |
| Common | 19 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2413737 | |
| e | 2328328 | |
| c | 1165771 | |
| i | 467905 | 4.4% |
| r | 452986 | 4.3% |
| o | 448954 | 4.3% |
| l | 342205 | 3.3% |
| t | 293841 | 2.8% |
| n | 274374 | 2.6% |
| s | 219276 | 2.1% |
| Other values (41) | 2117712 |
Common
| Value | Count | Frequency (%) |
| _ | 19 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10525108 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2413737 | |
| e | 2328328 | |
| c | 1165771 | |
| i | 467905 | 4.4% |
| r | 452986 | 4.3% |
| o | 448954 | 4.3% |
| l | 342205 | 3.3% |
| t | 293841 | 2.8% |
| n | 274374 | 2.6% |
| s | 219276 | 2.1% |
| Other values (42) | 2117731 |
subfamily
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 24 |
| Min length | 24 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 2024-12-02T13:57:09.776Z |
|---|
| Value | Count | Frequency (%) |
| 2024-12-02t13:57:09.776z | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 4 | |
| 0 | 3 | |
| 7 | 3 | |
| - | 2 | |
| 1 | 2 | |
| : | 2 | |
| 4 | 1 | 4.2% |
| T | 1 | 4.2% |
| 3 | 1 | 4.2% |
| 5 | 1 | 4.2% |
| Other values (4) | 4 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 17 | |
| Other Punctuation | 3 | 12.5% |
| Dash Punctuation | 2 | 8.3% |
| Uppercase Letter | 2 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 4 | |
| 0 | 3 | |
| 7 | 3 | |
| 1 | 2 | |
| 4 | 1 | 5.9% |
| 3 | 1 | 5.9% |
| 5 | 1 | 5.9% |
| 9 | 1 | 5.9% |
| 6 | 1 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 2 | |
| . | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1 | |
| Z | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 22 | |
| Latin | 2 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 4 | |
| 0 | 3 | |
| 7 | 3 | |
| - | 2 | |
| 1 | 2 | |
| : | 2 | |
| 4 | 1 | 4.5% |
| 3 | 1 | 4.5% |
| 5 | 1 | 4.5% |
| 9 | 1 | 4.5% |
| Other values (2) | 2 |
Latin
| Value | Count | Frequency (%) |
| T | 1 | |
| Z | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 4 | |
| 0 | 3 | |
| 7 | 3 | |
| - | 2 | |
| 1 | 2 | |
| : | 2 | |
| 4 | 1 | 4.2% |
| T | 1 | 4.2% |
| 3 | 1 | 4.2% |
| 5 | 1 | 4.2% |
| Other values (4) | 4 |
tribe
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 450.0 |
|---|
| Value | Count | Frequency (%) |
| 450.0 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 4 | 1 | |
| 5 | 1 | |
| . | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4 | |
| Other Punctuation | 1 | 20.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 4 | 1 | |
| 5 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 5 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 4 | 1 | |
| 5 | 1 | |
| . | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 4 | 1 | |
| 5 | 1 | |
| . | 1 |
subtribe
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 50.0 |
|---|
| Value | Count | Frequency (%) |
| 50.0 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 5 | 1 | |
| . | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3 | |
| Other Punctuation | 1 | 25.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 5 | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 5 | 1 | |
| . | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 2 | |
| 5 | 1 | |
| . | 1 |
genus
Text
Missing 
| Distinct | 14195 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 15345 |
| Missing (%) | 1.6% |
| Memory size | 7.5 MiB |
Length
| Max length | 22 |
|---|---|
| Median length | 19 |
| Mean length | 8.8481127 |
| Min length | 2 |
Unique
| Unique | 2072 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | Phymatolithon |
|---|---|
| 2nd row | Amicia |
| 3rd row | Callisia |
| 4th row | Connarus |
| 5th row | Trichoneura |
| Value | Count | Frequency (%) |
| carex | 12742 | 1.3% |
| miconia | 8772 | 0.9% |
| cladonia | 6873 | 0.7% |
| poa | 6684 | 0.7% |
| cyperus | 6044 | 0.6% |
| paspalum | 5820 | 0.6% |
| solanum | 5538 | 0.6% |
| eragrostis | 5205 | 0.5% |
| dichanthelium | 4464 | 0.5% |
| asplenium | 4297 | 0.4% |
| Other values (14184) | 906618 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1065944 | 12.4% |
| i | 802101 | 9.3% |
| o | 610885 | 7.1% |
| e | 599768 | 7.0% |
| r | 564074 | 6.6% |
| l | 476062 | 5.5% |
| s | 450892 | 5.2% |
| n | 446312 | 5.2% |
| u | 428361 | 5.0% |
| t | 360702 | 4.2% |
| Other values (43) | 2804617 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7636370 | |
| Uppercase Letter | 973079 | 11.3% |
| Dash Punctuation | 269 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1065944 | |
| i | 802101 | |
| o | 610885 | 8.0% |
| e | 599768 | 7.9% |
| r | 564074 | 7.4% |
| l | 476062 | 6.2% |
| s | 450892 | 5.9% |
| n | 446312 | 5.8% |
| u | 428361 | 5.6% |
| t | 360702 | 4.7% |
| Other values (16) | 1831269 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 136320 | |
| P | 129899 | |
| S | 98619 | |
| A | 86400 | 8.9% |
| M | 64229 | 6.6% |
| E | 54716 | 5.6% |
| L | 48991 | 5.0% |
| D | 46834 | 4.8% |
| H | 40071 | 4.1% |
| B | 40048 | 4.1% |
| Other values (16) | 226952 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 269 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8609449 | |
| Common | 269 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1065944 | 12.4% |
| i | 802101 | 9.3% |
| o | 610885 | 7.1% |
| e | 599768 | 7.0% |
| r | 564074 | 6.6% |
| l | 476062 | 5.5% |
| s | 450892 | 5.2% |
| n | 446312 | 5.2% |
| u | 428361 | 5.0% |
| t | 360702 | 4.2% |
| Other values (42) | 2804348 |
Common
| Value | Count | Frequency (%) |
| - | 269 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8609718 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1065944 | 12.4% |
| i | 802101 | 9.3% |
| o | 610885 | 7.1% |
| e | 599768 | 7.0% |
| r | 564074 | 6.6% |
| l | 476062 | 5.5% |
| s | 450892 | 5.2% |
| n | 446312 | 5.2% |
| u | 428361 | 5.0% |
| t | 360702 | 4.2% |
| Other values (43) | 2804617 |
genericName
Text
Missing 
| Distinct | 15150 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 15400 |
| Missing (%) | 1.6% |
| Memory size | 7.5 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 17 |
| Mean length | 8.785601674 |
| Min length | 2 |
Unique
| Unique | 2933 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | Lithothamnion |
|---|---|
| 2nd row | Amicia |
| 3rd row | Tripogandra |
| 4th row | Connarus |
| 5th row | Trichoneura |
| Value | Count | Frequency (%) |
| carex | 12732 | 1.3% |
| poa | 6687 | 0.7% |
| cyperus | 6038 | 0.6% |
| cladonia | 5891 | 0.6% |
| paspalum | 5802 | 0.6% |
| miconia | 5466 | 0.6% |
| solanum | 5416 | 0.6% |
| eragrostis | 5200 | 0.5% |
| asplenium | 4423 | 0.5% |
| dichanthelium | 4230 | 0.4% |
| Other values (15139) | 911117 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1056368 | 12.4% |
| i | 790190 | 9.2% |
| o | 599730 | 7.0% |
| e | 592542 | 6.9% |
| r | 561767 | 6.6% |
| l | 470384 | 5.5% |
| s | 445526 | 5.2% |
| n | 443561 | 5.2% |
| u | 432434 | 5.1% |
| t | 358837 | 4.2% |
| Other values (44) | 2797069 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7575383 | |
| Uppercase Letter | 973005 | 11.4% |
| Dash Punctuation | 20 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1056368 | |
| i | 790190 | |
| o | 599730 | 7.9% |
| e | 592542 | 7.8% |
| r | 561767 | 7.4% |
| l | 470384 | 6.2% |
| s | 445526 | 5.9% |
| n | 443561 | 5.9% |
| u | 432434 | 5.7% |
| t | 358837 | 4.7% |
| Other values (17) | 1824044 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 139989 | |
| P | 128503 | |
| S | 96475 | |
| A | 88126 | 9.1% |
| M | 61343 | 6.3% |
| E | 52472 | 5.4% |
| L | 52006 | 5.3% |
| D | 45953 | 4.7% |
| B | 42140 | 4.3% |
| H | 40016 | 4.1% |
| Other values (16) | 225982 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 20 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8548388 | |
| Common | 20 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1056368 | 12.4% |
| i | 790190 | 9.2% |
| o | 599730 | 7.0% |
| e | 592542 | 6.9% |
| r | 561767 | 6.6% |
| l | 470384 | 5.5% |
| s | 445526 | 5.2% |
| n | 443561 | 5.2% |
| u | 432434 | 5.1% |
| t | 358837 | 4.2% |
| Other values (43) | 2797049 |
Common
| Value | Count | Frequency (%) |
| - | 20 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8548392 | |
| None | 16 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1056368 | 12.4% |
| i | 790190 | 9.2% |
| o | 599730 | 7.0% |
| e | 592542 | 6.9% |
| r | 561767 | 6.6% |
| l | 470384 | 5.5% |
| s | 445526 | 5.2% |
| n | 443561 | 5.2% |
| u | 432434 | 5.1% |
| t | 358837 | 4.2% |
| Other values (43) | 2797053 |
None
| Value | Count | Frequency (%) |
| ë | 16 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 48 |
|---|---|
| Median length | 48 |
| Mean length | 48 |
| Min length | 48 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
|---|
| Value | Count | Frequency (%) |
| occurrence_status_inferred_from_individual_count | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 5 | |
| _ | 5 | |
| C | 4 | |
| U | 4 | |
| E | 4 | |
| N | 4 | |
| I | 4 | |
| O | 3 | 6.2% |
| T | 3 | 6.2% |
| D | 3 | 6.2% |
| Other values (6) | 9 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 43 | |
| Connector Punctuation | 5 | 10.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 5 | |
| C | 4 | |
| U | 4 | |
| E | 4 | |
| N | 4 | |
| I | 4 | |
| O | 3 | |
| T | 3 | |
| D | 3 | |
| S | 2 | 4.7% |
| Other values (5) | 7 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 43 | |
| Common | 5 | 10.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 5 | |
| C | 4 | |
| U | 4 | |
| E | 4 | |
| N | 4 | |
| I | 4 | |
| O | 3 | |
| T | 3 | |
| D | 3 | |
| S | 2 | 4.7% |
| Other values (5) | 7 |
Common
| Value | Count | Frequency (%) |
| _ | 5 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 48 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 5 | |
| _ | 5 | |
| C | 4 | |
| U | 4 | |
| E | 4 | |
| N | 4 | |
| I | 4 | |
| O | 3 | 6.2% |
| T | 3 | 6.2% |
| D | 3 | 6.2% |
| Other values (6) | 9 |
specificEpithet
Text
Missing 
| Distinct | 44923 |
|---|---|
| Distinct (%) | 4.9% |
| Missing | 75483 |
| Missing (%) | 7.6% |
| Memory size | 7.5 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 22 |
| Mean length | 9.15062344 |
| Min length | 3 |
Unique
| Unique | 14847 ? |
|---|---|
| Unique (%) | 1.6% |
Sample
| 1st row | calcareum |
|---|---|
| 2nd row | glandulosa |
| 3rd row | glandulosa |
| 4th row | steyermarkii |
| 5th row | grandiglumis |
| Value | Count | Frequency (%) |
| canadensis | 2613 | 0.3% |
| guianensis | 2604 | 0.3% |
| americana | 2509 | 0.3% |
| latifolia | 2449 | 0.3% |
| parviflora | 2235 | 0.2% |
| repens | 2200 | 0.2% |
| gracilis | 2040 | 0.2% |
| occidentalis | 2004 | 0.2% |
| indica | 1946 | 0.2% |
| pubescens | 1937 | 0.2% |
| Other values (44913) | 890382 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 1131933 | |
| i | 964476 | |
| s | 606683 | 7.3% |
| e | 594963 | 7.1% |
| r | 547391 | 6.6% |
| l | 544114 | 6.5% |
| n | 520530 | 6.2% |
| u | 490822 | 5.9% |
| o | 487476 | 5.8% |
| t | 439575 | 5.3% |
| Other values (22) | 2025815 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 8349817 | |
| Dash Punctuation | 3959 | < 0.1% |
| Uppercase Letter | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1131933 | |
| i | 964476 | |
| s | 606683 | 7.3% |
| e | 594963 | 7.1% |
| r | 547391 | 6.6% |
| l | 544114 | 6.5% |
| n | 520530 | 6.2% |
| u | 490822 | 5.9% |
| o | 487476 | 5.8% |
| t | 439575 | 5.3% |
| Other values (19) | 2021854 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 1 | |
| I | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3959 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8349819 | |
| Common | 3959 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1131933 | |
| i | 964476 | |
| s | 606683 | 7.3% |
| e | 594963 | 7.1% |
| r | 547391 | 6.6% |
| l | 544114 | 6.5% |
| n | 520530 | 6.2% |
| u | 490822 | 5.9% |
| o | 487476 | 5.8% |
| t | 439575 | 5.3% |
| Other values (21) | 2021856 |
Common
| Value | Count | Frequency (%) |
| - | 3959 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8353755 | |
| None | 23 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 1131933 | |
| i | 964476 | |
| s | 606683 | 7.3% |
| e | 594963 | 7.1% |
| r | 547391 | 6.6% |
| l | 544114 | 6.5% |
| n | 520530 | 6.2% |
| u | 490822 | 5.9% |
| o | 487476 | 5.8% |
| t | 439575 | 5.3% |
| Other values (19) | 2025792 |
None
| Value | Count | Frequency (%) |
| ï | 15 | |
| ë | 6 | 26.1% |
| ü | 2 | 8.7% |
Missing 
| Distinct | 6984 |
|---|---|
| Distinct (%) | 10.8% |
| Missing | 923675 |
| Missing (%) | 93.5% |
| Memory size | 7.5 MiB |
Length
| Max length | 25 |
|---|---|
| Median length | 19 |
| Mean length | 9.201986806 |
| Min length | 4 |
Unique
| Unique | 2531 ? |
|---|---|
| Unique (%) | 3.9% |
Sample
| 1st row | oxyphylla |
|---|---|
| 2nd row | subalpinum |
| 3rd row | pubescens |
| 4th row | hirsuta |
| 5th row | crispa |
| Value | Count | Frequency (%) |
| acuminatum | 942 | 1.5% |
| pubescens | 386 | 0.6% |
| secunda | 352 | 0.5% |
| dichotomum | 328 | 0.5% |
| gracilis | 322 | 0.5% |
| americana | 321 | 0.5% |
| angustifolia | 270 | 0.4% |
| glauca | 264 | 0.4% |
| occidentalis | 234 | 0.4% |
| mexicana | 225 | 0.3% |
| Other values (6974) | 61083 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 81071 | |
| i | 68606 | |
| s | 43652 | 7.3% |
| e | 41408 | 7.0% |
| l | 40211 | 6.8% |
| n | 37437 | 6.3% |
| r | 36574 | 6.1% |
| u | 35997 | 6.0% |
| o | 33783 | 5.7% |
| t | 30496 | 5.1% |
| Other values (18) | 146382 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 595493 | |
| Dash Punctuation | 124 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 81071 | |
| i | 68606 | |
| s | 43652 | 7.3% |
| e | 41408 | 7.0% |
| l | 40211 | 6.8% |
| n | 37437 | 6.3% |
| r | 36574 | 6.1% |
| u | 35997 | 6.0% |
| o | 33783 | 5.7% |
| t | 30496 | 5.1% |
| Other values (17) | 146258 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 124 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 595493 | |
| Common | 124 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 81071 | |
| i | 68606 | |
| s | 43652 | 7.3% |
| e | 41408 | 7.0% |
| l | 40211 | 6.8% |
| n | 37437 | 6.3% |
| r | 36574 | 6.1% |
| u | 35997 | 6.0% |
| o | 33783 | 5.7% |
| t | 30496 | 5.1% |
| Other values (17) | 146258 |
Common
| Value | Count | Frequency (%) |
| - | 124 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 595616 | |
| None | 1 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 81071 | |
| i | 68606 | |
| s | 43652 | 7.3% |
| e | 41408 | 7.0% |
| l | 40211 | 6.8% |
| n | 37437 | 6.3% |
| r | 36574 | 6.1% |
| u | 35997 | 6.0% |
| o | 33783 | 5.7% |
| t | 30496 | 5.1% |
| Other values (17) | 146381 |
None
| Value | Count | Frequency (%) |
| ë | 1 |
cultivarEpithet
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 5 |
| Min length | 5 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | false |
|---|
| Value | Count | Frequency (%) |
| false | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| f | 1 | |
| a | 1 | |
| l | 1 | |
| s | 1 | |
| e | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 5 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 1 | |
| a | 1 | |
| l | 1 | |
| s | 1 | |
| e | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| f | 1 | |
| a | 1 | |
| l | 1 | |
| s | 1 | |
| e | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| f | 1 | |
| a | 1 | |
| l | 1 | |
| s | 1 | |
| e | 1 |
taxonRank
Text
| Distinct | 11 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 7.5 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 7 |
| Mean length | 6.92467928 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | SPECIES |
|---|---|
| 2nd row | SPECIES |
| 3rd row | SPECIES |
| 4th row | SPECIES |
| 5th row | SPECIES |
| Value | Count | Frequency (%) |
| species | 848247 | |
| genus | 60084 | 6.1% |
| variety | 42962 | 4.3% |
| subspecies | 20363 | 2.1% |
| family | 5330 | 0.5% |
| kingdom | 4747 | 0.5% |
| phylum | 4695 | 0.5% |
| form | 1401 | 0.1% |
| class | 501 | 0.1% |
| order | 69 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 1840335 | |
| S | 1818669 | |
| I | 921649 | |
| P | 873305 | |
| C | 869111 | |
| U | 85142 | 1.2% |
| G | 64831 | 0.9% |
| N | 64831 | 0.9% |
| Y | 52987 | 0.8% |
| A | 48793 | 0.7% |
| Other values (17) | 204700 | 3.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 6844346 | |
| Decimal Number | 7 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1840335 | |
| S | 1818669 | |
| I | 921649 | |
| P | 873305 | |
| C | 869111 | |
| U | 85142 | 1.2% |
| G | 64831 | 0.9% |
| N | 64831 | 0.9% |
| Y | 52987 | 0.8% |
| A | 48793 | 0.7% |
| Other values (11) | 204693 | 3.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 7 | 1 | |
| 9 | 1 | |
| 6 | 1 | |
| 1 | 1 | |
| 0 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6844346 | |
| Common | 7 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 1840335 | |
| S | 1818669 | |
| I | 921649 | |
| P | 873305 | |
| C | 869111 | |
| U | 85142 | 1.2% |
| G | 64831 | 0.9% |
| N | 64831 | 0.9% |
| Y | 52987 | 0.8% |
| A | 48793 | 0.7% |
| Other values (11) | 204693 | 3.0% |
Common
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 7 | 1 | |
| 9 | 1 | |
| 6 | 1 | |
| 1 | 1 | |
| 0 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6844353 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 1840335 | |
| S | 1818669 | |
| I | 921649 | |
| P | 873305 | |
| C | 869111 | |
| U | 85142 | 1.2% |
| G | 64831 | 0.9% |
| N | 64831 | 0.9% |
| Y | 52987 | 0.8% |
| A | 48793 | 0.7% |
| Other values (17) | 204700 | 3.0% |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 7296210 |
|---|
| Value | Count | Frequency (%) |
| 7296210 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 7 | 1 | |
| 9 | 1 | |
| 6 | 1 | |
| 1 | 1 | |
| 0 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 7 | 1 | |
| 9 | 1 | |
| 6 | 1 | |
| 1 | 1 | |
| 0 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 7 | 1 | |
| 9 | 1 | |
| 6 | 1 | |
| 1 | 1 | |
| 0 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 7 | 1 | |
| 9 | 1 | |
| 6 | 1 | |
| 1 | 1 | |
| 0 | 1 |
vernacularName
Text
Missing 
| Distinct | 2 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988400 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 4.5 |
| Mean length | 4.5 |
| Min length | 1 |
Unique
| Unique | 2 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 6 |
|---|---|
| 2nd row | HOLOTYPE |
| Value | Count | Frequency (%) |
| 6 | 1 | |
| holotype | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| O | 2 | |
| 6 | 1 | |
| H | 1 | |
| L | 1 | |
| T | 1 | |
| Y | 1 | |
| P | 1 | |
| E | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 8 | |
| Decimal Number | 1 | 11.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| O | 2 | |
| H | 1 | |
| L | 1 | |
| T | 1 | |
| Y | 1 | |
| P | 1 | |
| E | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 8 | |
| Common | 1 | 11.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| O | 2 | |
| H | 1 | |
| L | 1 | |
| T | 1 | |
| Y | 1 | |
| P | 1 | |
| E | 1 |
Common
| Value | Count | Frequency (%) |
| 6 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| O | 2 | |
| 6 | 1 | |
| H | 1 | |
| L | 1 | |
| T | 1 | |
| Y | 1 | |
| P | 1 | |
| E | 1 |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 7 |
|---|---|
| Median length | 7 |
| Mean length | 7 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 7707728 |
|---|
| Value | Count | Frequency (%) |
| 7707728 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 4 | |
| 0 | 1 | 14.3% |
| 2 | 1 | 14.3% |
| 8 | 1 | 14.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 4 | |
| 0 | 1 | 14.3% |
| 2 | 1 | 14.3% |
| 8 | 1 | 14.3% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 7 | 4 | |
| 0 | 1 | 14.3% |
| 2 | 1 | 14.3% |
| 8 | 1 | 14.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7 | 4 | |
| 0 | 1 | 14.3% |
| 2 | 1 | 14.3% |
| 8 | 1 | 14.3% |
taxonomicStatus
Text
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3368 |
| Missing (%) | 0.3% |
| Memory size | 7.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 8 |
| Mean length | 7.802124597 |
| Min length | 3 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | SYNONYM |
|---|---|
| 2nd row | ACCEPTED |
| 3rd row | SYNONYM |
| 4th row | ACCEPTED |
| 5th row | ACCEPTED |
| Value | Count | Frequency (%) |
| accepted | 779014 | |
| synonym | 194909 | 19.8% |
| doubtful | 11110 | 1.1% |
| 220 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 1558028 | |
| C | 1558028 | |
| T | 790124 | |
| D | 790124 | |
| A | 779014 | |
| P | 779014 | |
| Y | 389818 | 5.1% |
| N | 389818 | 5.1% |
| O | 206019 | 2.7% |
| S | 194909 | 2.5% |
| Other values (7) | 250462 | 3.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 7685355 | |
| Decimal Number | 3 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1558028 | |
| C | 1558028 | |
| T | 790124 | |
| D | 790124 | |
| A | 779014 | |
| P | 779014 | |
| Y | 389818 | 5.1% |
| N | 389818 | 5.1% |
| O | 206019 | 2.7% |
| S | 194909 | 2.5% |
| Other values (5) | 250459 | 3.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 0 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7685355 | |
| Common | 3 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 1558028 | |
| C | 1558028 | |
| T | 790124 | |
| D | 790124 | |
| A | 779014 | |
| P | 779014 | |
| Y | 389818 | 5.1% |
| N | 389818 | 5.1% |
| O | 206019 | 2.7% |
| S | 194909 | 2.5% |
| Other values (5) | 250459 | 3.3% |
Common
| Value | Count | Frequency (%) |
| 2 | 2 | |
| 0 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7685358 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 1558028 | |
| C | 1558028 | |
| T | 790124 | |
| D | 790124 | |
| A | 779014 | |
| P | 779014 | |
| Y | 389818 | 5.1% |
| N | 389818 | 5.1% |
| O | 206019 | 2.7% |
| S | 194909 | 2.5% |
| Other values (7) | 250462 | 3.3% |
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 1414 |
|---|
| Value | Count | Frequency (%) |
| 1414 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 4 | 2 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 4 | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 4 | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 2 | |
| 4 | 2 |
taxonRemarks
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 4 |
|---|---|
| Median length | 4 |
| Mean length | 4 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | 6631 |
|---|
| Value | Count | Frequency (%) |
| 6631 | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 2 | |
| 3 | 1 | |
| 1 | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 2 | |
| 3 | 1 | |
| 1 | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 2 | |
| 3 | 1 | |
| 1 | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 2 | |
| 3 | 1 | |
| 1 | 1 |
datasetKey
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 7.5 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 36 |
| Mean length | 35.99997066 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
|---|---|
| 2nd row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| 3rd row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| 4th row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| 5th row | 821cc27a-e3bb-4bc5-ac34-89ada245069d |
| Value | Count | Frequency (%) |
| 821cc27a-e3bb-4bc5-ac34-89ada245069d | 988399 | |
| 7296208 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| c | 3953596 | |
| a | 3953596 | |
| - | 3953596 | |
| 2 | 2965199 | |
| b | 2965197 | |
| 4 | 2965197 | |
| 8 | 1976799 | 5.6% |
| 9 | 1976799 | 5.6% |
| 3 | 1976798 | 5.6% |
| 5 | 1976798 | 5.6% |
| Other values (6) | 6918796 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 17791189 | |
| Lowercase Letter | 13837586 | |
| Dash Punctuation | 3953596 | 11.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 2965199 | |
| 4 | 2965197 | |
| 8 | 1976799 | |
| 9 | 1976799 | |
| 3 | 1976798 | |
| 5 | 1976798 | |
| 7 | 988400 | 5.6% |
| 0 | 988400 | 5.6% |
| 6 | 988400 | 5.6% |
| 1 | 988399 | 5.6% |
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 3953596 | |
| a | 3953596 | |
| b | 2965197 | |
| d | 1976798 | |
| e | 988399 | 7.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3953596 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 21744785 | |
| Latin | 13837586 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| - | 3953596 | |
| 2 | 2965199 | |
| 4 | 2965197 | |
| 8 | 1976799 | |
| 9 | 1976799 | |
| 3 | 1976798 | |
| 5 | 1976798 | |
| 7 | 988400 | 4.5% |
| 0 | 988400 | 4.5% |
| 6 | 988400 | 4.5% |
Latin
| Value | Count | Frequency (%) |
| c | 3953596 | |
| a | 3953596 | |
| b | 2965197 | |
| d | 1976798 | |
| e | 988399 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 35582371 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| c | 3953596 | |
| a | 3953596 | |
| - | 3953596 | |
| 2 | 2965199 | |
| b | 2965197 | |
| 4 | 2965197 | |
| 8 | 1976799 | 5.6% |
| 9 | 1976799 | 5.6% |
| 3 | 1976798 | 5.6% |
| 5 | 1976798 | 5.6% |
| Other values (6) | 6918796 |
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Memory size | 7.5 MiB |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | US |
|---|---|
| 2nd row | US |
| 3rd row | US |
| 4th row | US |
| 5th row | US |
| Value | Count | Frequency (%) |
| us | 988399 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 988399 | |
| S | 988399 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1976798 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 988399 | |
| S | 988399 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1976798 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 988399 | |
| S | 988399 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1976798 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 988399 | |
| S | 988399 |
lastInterpreted
Text
| Distinct | 200353 |
|---|---|
| Distinct (%) | 20.3% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 7.5 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.99574565 |
| Min length | 7 |
Unique
| Unique | 20728 ? |
|---|---|
| Unique (%) | 2.1% |
Sample
| 1st row | 2024-12-02T13:59:14.452Z |
|---|---|
| 2nd row | 2024-12-02T13:57:49.629Z |
| 3rd row | 2024-12-02T13:57:49.533Z |
| 4th row | 2024-12-02T13:59:17.370Z |
| 5th row | 2024-12-02T13:59:30.710Z |
| Value | Count | Frequency (%) |
| 2024-12-02t13:56:52.667z | 24 | < 0.1% |
| 2024-12-02t13:57:28.323z | 24 | < 0.1% |
| 2024-12-02t13:57:53.831z | 24 | < 0.1% |
| 2024-12-02t13:57:53.200z | 23 | < 0.1% |
| 2024-12-02t13:57:24.579z | 23 | < 0.1% |
| 2024-12-02t13:57:45.844z | 23 | < 0.1% |
| 2024-12-02t13:57:43.276z | 23 | < 0.1% |
| 2024-12-02t13:57:45.207z | 23 | < 0.1% |
| 2024-12-02t13:57:50.630z | 22 | < 0.1% |
| 2024-12-02t13:57:52.903z | 22 | < 0.1% |
| Other values (200343) | 988169 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 4511298 | |
| 0 | 2508199 | |
| 1 | 2493151 | |
| - | 1976798 | |
| : | 1976798 | |
| 4 | 1590376 | 6.7% |
| 5 | 1570391 | 6.6% |
| 3 | 1563965 | 6.6% |
| T | 988399 | 4.2% |
| Z | 988399 | 4.2% |
| Other values (5) | 3549621 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 16799649 | |
| Other Punctuation | 2964150 | 12.5% |
| Dash Punctuation | 1976798 | 8.3% |
| Uppercase Letter | 1976798 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 4511298 | |
| 0 | 2508199 | |
| 1 | 2493151 | |
| 4 | 1590376 | 9.5% |
| 5 | 1570391 | 9.3% |
| 3 | 1563965 | 9.3% |
| 7 | 759695 | 4.5% |
| 9 | 633974 | 3.8% |
| 6 | 594334 | 3.5% |
| 8 | 574266 | 3.4% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1976798 | |
| . | 987352 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 988399 | |
| Z | 988399 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1976798 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 21740597 | |
| Latin | 1976798 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 4511298 | |
| 0 | 2508199 | |
| 1 | 2493151 | |
| - | 1976798 | |
| : | 1976798 | |
| 4 | 1590376 | 7.3% |
| 5 | 1570391 | 7.2% |
| 3 | 1563965 | 7.2% |
| . | 987352 | 4.5% |
| 7 | 759695 | 3.5% |
| Other values (3) | 1802574 | 8.3% |
Latin
| Value | Count | Frequency (%) |
| T | 988399 | |
| Z | 988399 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23717395 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 4511298 | |
| 0 | 2508199 | |
| 1 | 2493151 | |
| - | 1976798 | |
| : | 1976798 | |
| 4 | 1590376 | 6.7% |
| 5 | 1570391 | 6.6% |
| 3 | 1563965 | 6.6% |
| T | 988399 | 4.2% |
| Z | 988399 | 4.2% |
| Other values (5) | 3549621 |
elevation
Text
Missing 
| Distinct | 4953 |
|---|---|
| Distinct (%) | 1.4% |
| Missing | 625728 |
| Missing (%) | 63.3% |
| Memory size | 7.5 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 6 |
| Mean length | 5.363784004 |
| Min length | 3 |
Unique
| Unique | 1031 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 2742.0 |
|---|---|
| 2nd row | 750.0 |
| 3rd row | 50.0 |
| 4th row | 225.0 |
| 5th row | 17.0 |
| Value | Count | Frequency (%) |
| 1000.0 | 6075 | 1.7% |
| 100.0 | 5877 | 1.6% |
| 500.0 | 4957 | 1.4% |
| 200.0 | 4795 | 1.3% |
| 300.0 | 4744 | 1.3% |
| 800.0 | 4519 | 1.2% |
| 400.0 | 4320 | 1.2% |
| 1500.0 | 4189 | 1.2% |
| 1200.0 | 4187 | 1.2% |
| 900.0 | 4104 | 1.1% |
| Other values (4927) | 314908 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 709378 | |
| . | 362673 | |
| 1 | 180866 | 9.3% |
| 5 | 150898 | 7.8% |
| 2 | 145612 | 7.5% |
| 3 | 90804 | 4.7% |
| 4 | 69684 | 3.6% |
| 7 | 63645 | 3.3% |
| 6 | 61615 | 3.2% |
| 8 | 58985 | 3.0% |
| Other values (15) | 51145 | 2.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 1582558 | |
| Other Punctuation | 362673 | 18.6% |
| Dash Punctuation | 57 | < 0.1% |
| Lowercase Letter | 15 | < 0.1% |
| Uppercase Letter | 1 | < 0.1% |
| Space Separator | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 2 | |
| o | 2 | |
| r | 2 | |
| a | 2 | |
| p | 1 | |
| f | 1 | |
| b | 1 | |
| u | 1 | |
| e | 1 | |
| n | 1 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 709378 | |
| 1 | 180866 | 11.4% |
| 5 | 150898 | 9.5% |
| 2 | 145612 | 9.2% |
| 3 | 90804 | 5.7% |
| 4 | 69684 | 4.4% |
| 7 | 63645 | 4.0% |
| 6 | 61615 | 3.9% |
| 8 | 58985 | 3.7% |
| 9 | 51071 | 3.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 362673 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 57 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 1945289 | |
| Latin | 16 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 709378 | |
| . | 362673 | |
| 1 | 180866 | 9.3% |
| 5 | 150898 | 7.8% |
| 2 | 145612 | 7.5% |
| 3 | 90804 | 4.7% |
| 4 | 69684 | 3.6% |
| 7 | 63645 | 3.3% |
| 6 | 61615 | 3.2% |
| 8 | 58985 | 3.0% |
| Other values (3) | 51129 | 2.6% |
Latin
| Value | Count | Frequency (%) |
| i | 2 | |
| o | 2 | |
| r | 2 | |
| a | 2 | |
| p | 1 | |
| f | 1 | |
| b | 1 | |
| u | 1 | |
| R | 1 | |
| e | 1 | |
| Other values (2) | 2 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1945305 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 709378 | |
| . | 362673 | |
| 1 | 180866 | 9.3% |
| 5 | 150898 | 7.8% |
| 2 | 145612 | 7.5% |
| 3 | 90804 | 4.7% |
| 4 | 69684 | 3.6% |
| 7 | 63645 | 3.3% |
| 6 | 61615 | 3.2% |
| 8 | 58985 | 3.0% |
| Other values (15) | 51145 | 2.6% |
Missing 
| Distinct | 858 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 880635 |
| Missing (%) | 89.1% |
| Memory size | 7.5 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 19 |
| Mean length | 4.058227472 |
| Min length | 3 |
Unique
| Unique | 278 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | 225.0 |
|---|---|
| 2nd row | 100.0 |
| 3rd row | 0.0 |
| 4th row | 0.0 |
| 5th row | 259.0 |
| Value | Count | Frequency (%) |
| 0.0 | 25899 | |
| 50.0 | 12761 | 11.8% |
| 100.0 | 8238 | 7.6% |
| 150.0 | 5589 | 5.2% |
| 25.0 | 5263 | 4.9% |
| 75.0 | 3266 | 3.0% |
| 200.0 | 3102 | 2.9% |
| 152.5 | 2249 | 2.1% |
| 10.0 | 1930 | 1.8% |
| 250.0 | 1871 | 1.7% |
| Other values (850) | 37602 |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 178346 | |
| . | 107766 | |
| 5 | 60739 | 13.9% |
| 1 | 31294 | 7.2% |
| 2 | 23940 | 5.5% |
| 7 | 10221 | 2.3% |
| 3 | 9561 | 2.2% |
| 4 | 5552 | 1.3% |
| 6 | 4732 | 1.1% |
| 8 | 2655 | 0.6% |
| Other values (15) | 2537 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 329546 | |
| Other Punctuation | 107766 | 24.6% |
| Lowercase Letter | 27 | < 0.1% |
| Space Separator | 3 | < 0.1% |
| Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 4 | |
| a | 4 | |
| i | 3 | |
| o | 3 | |
| p | 2 | |
| u | 2 | |
| b | 2 | |
| f | 2 | |
| l | 2 | |
| e | 1 | 3.7% |
| Other values (2) | 2 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 178346 | |
| 5 | 60739 | 18.4% |
| 1 | 31294 | 9.5% |
| 2 | 23940 | 7.3% |
| 7 | 10221 | 3.1% |
| 3 | 9561 | 2.9% |
| 4 | 5552 | 1.7% |
| 6 | 4732 | 1.4% |
| 8 | 2655 | 0.8% |
| 9 | 2506 | 0.8% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 107766 |
Space Separator
| Value | Count | Frequency (%) |
| 3 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 437315 | |
| Latin | 28 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 4 | |
| a | 4 | |
| i | 3 | |
| o | 3 | |
| p | 2 | |
| u | 2 | |
| b | 2 | |
| f | 2 | |
| l | 2 | |
| e | 1 | 3.6% |
| Other values (3) | 3 |
Common
| Value | Count | Frequency (%) |
| 0 | 178346 | |
| . | 107766 | |
| 5 | 60739 | 13.9% |
| 1 | 31294 | 7.2% |
| 2 | 23940 | 5.5% |
| 7 | 10221 | 2.3% |
| 3 | 9561 | 2.2% |
| 4 | 5552 | 1.3% |
| 6 | 4732 | 1.1% |
| 8 | 2655 | 0.6% |
| Other values (2) | 2509 | 0.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 437343 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 178346 | |
| . | 107766 | |
| 5 | 60739 | 13.9% |
| 1 | 31294 | 7.2% |
| 2 | 23940 | 5.5% |
| 7 | 10221 | 2.3% |
| 3 | 9561 | 2.2% |
| 4 | 5552 | 1.3% |
| 6 | 4732 | 1.1% |
| 8 | 2655 | 0.6% |
| Other values (15) | 2537 | 0.6% |
depth
Text
Missing 
| Distinct | 138 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 979722 |
| Missing (%) | 99.1% |
| Memory size | 7.5 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 4 |
| Mean length | 3.671198157 |
| Min length | 3 |
Unique
| Unique | 44 ? |
|---|---|
| Unique (%) | 0.5% |
Sample
| 1st row | 3.0 |
|---|---|
| 2nd row | 24.0 |
| 3rd row | 3.0 |
| 4th row | 12.0 |
| 5th row | 6.0 |
| Value | Count | Frequency (%) |
| 12.0 | 1243 | |
| 18.0 | 1164 | |
| 6.0 | 1076 | |
| 24.0 | 937 | |
| 3.0 | 492 | 5.7% |
| 43.0 | 404 | 4.7% |
| 32.0 | 402 | 4.6% |
| 1.5 | 287 | 3.3% |
| 10.0 | 209 | 2.4% |
| 13.0 | 171 | 2.0% |
| Other values (130) | 2298 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 8680 | |
| 0 | 7901 | |
| 1 | 3977 | |
| 2 | 2988 | 9.4% |
| 4 | 1703 | 5.3% |
| 3 | 1695 | 5.3% |
| 5 | 1653 | 5.2% |
| 6 | 1421 | 4.5% |
| 8 | 1313 | 4.1% |
| 7 | 341 | 1.1% |
| Other values (15) | 194 | 0.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 23155 | |
| Other Punctuation | 8680 | 27.2% |
| Lowercase Letter | 27 | 0.1% |
| Space Separator | 3 | < 0.1% |
| Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| r | 4 | |
| a | 4 | |
| i | 3 | |
| o | 3 | |
| p | 2 | |
| u | 2 | |
| b | 2 | |
| f | 2 | |
| l | 2 | |
| e | 1 | 3.7% |
| Other values (2) | 2 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 7901 | |
| 1 | 3977 | |
| 2 | 2988 | 12.9% |
| 4 | 1703 | 7.4% |
| 3 | 1695 | 7.3% |
| 5 | 1653 | 7.1% |
| 6 | 1421 | 6.1% |
| 8 | 1313 | 5.7% |
| 7 | 341 | 1.5% |
| 9 | 163 | 0.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 8680 |
Space Separator
| Value | Count | Frequency (%) |
| 3 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 31838 | |
| Latin | 28 | 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| r | 4 | |
| a | 4 | |
| i | 3 | |
| o | 3 | |
| p | 2 | |
| u | 2 | |
| b | 2 | |
| f | 2 | |
| l | 2 | |
| e | 1 | 3.6% |
| Other values (3) | 3 |
Common
| Value | Count | Frequency (%) |
| . | 8680 | |
| 0 | 7901 | |
| 1 | 3977 | |
| 2 | 2988 | 9.4% |
| 4 | 1703 | 5.3% |
| 3 | 1695 | 5.3% |
| 5 | 1653 | 5.2% |
| 6 | 1421 | 4.5% |
| 8 | 1313 | 4.1% |
| 7 | 341 | 1.1% |
| Other values (2) | 166 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 31866 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 8680 | |
| 0 | 7901 | |
| 1 | 3977 | |
| 2 | 2988 | 9.4% |
| 4 | 1703 | 5.3% |
| 3 | 1695 | 5.3% |
| 5 | 1653 | 5.2% |
| 6 | 1421 | 4.5% |
| 8 | 1313 | 4.1% |
| 7 | 341 | 1.1% |
| Other values (15) | 194 | 0.6% |
depthAccuracy
Text
Missing 
| Distinct | 38 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 980482 |
| Missing (%) | 99.2% |
| Memory size | 7.5 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 3 |
| Mean length | 3.033964646 |
| Min length | 3 |
Unique
| Unique | 13 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 0.0 |
|---|---|
| 2nd row | 3.0 |
| 3rd row | 0.0 |
| 4th row | 3.0 |
| 5th row | 3.0 |
| Value | Count | Frequency (%) |
| 3.0 | 4303 | |
| 1.0 | 646 | 8.2% |
| 1.5 | 562 | 7.1% |
| 6.0 | 519 | 6.6% |
| 0.0 | 430 | 5.4% |
| 5.0 | 409 | 5.2% |
| 2.5 | 208 | 2.6% |
| 2.0 | 165 | 2.1% |
| 4.5 | 141 | 1.8% |
| 0.5 | 119 | 1.5% |
| Other values (28) | 418 | 5.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 7920 | |
| 0 | 7391 | |
| 3 | 4349 | |
| 5 | 1773 | 7.4% |
| 1 | 1300 | 5.4% |
| 6 | 546 | 2.3% |
| 2 | 399 | 1.7% |
| 4 | 205 | 0.9% |
| 7 | 130 | 0.5% |
| 8 | 9 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 16109 | |
| Other Punctuation | 7920 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 7391 | |
| 3 | 4349 | |
| 5 | 1773 | 11.0% |
| 1 | 1300 | 8.1% |
| 6 | 546 | 3.4% |
| 2 | 399 | 2.5% |
| 4 | 205 | 1.3% |
| 7 | 130 | 0.8% |
| 8 | 9 | 0.1% |
| 9 | 7 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 7920 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 24029 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| . | 7920 | |
| 0 | 7391 | |
| 3 | 4349 | |
| 5 | 1773 | 7.4% |
| 1 | 1300 | 5.4% |
| 6 | 546 | 2.3% |
| 2 | 399 | 1.7% |
| 4 | 205 | 0.9% |
| 7 | 130 | 0.5% |
| 8 | 9 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24029 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 7920 | |
| 0 | 7391 | |
| 3 | 4349 | |
| 5 | 1773 | 7.4% |
| 1 | 1300 | 5.4% |
| 6 | 546 | 2.3% |
| 2 | 399 | 1.7% |
| 4 | 205 | 0.9% |
| 7 | 130 | 0.5% |
| 8 | 9 | < 0.1% |
distanceFromCentroidInMeters
Text
Missing 
| Distinct | 268 |
|---|---|
| Distinct (%) | 45.0% |
| Missing | 987807 |
| Missing (%) | 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 18 |
|---|---|
| Median length | 17 |
| Mean length | 16.98151261 |
| Min length | 3 |
Unique
| Unique | 175 ? |
|---|---|
| Unique (%) | 29.4% |
Sample
| 1st row | 4008.084105458271 |
|---|---|
| 2nd row | 3618.177880660989 |
| 3rd row | 3836.5095124475733 |
| 4th row | 4578.201466648226 |
| 5th row | 4726.696371513394 |
| Value | Count | Frequency (%) |
| 2015.7207067821585 | 45 | 7.6% |
| 3318.235939960053 | 28 | 4.7% |
| 3731.647014894624 | 17 | 2.9% |
| 0.0 | 16 | 2.7% |
| 2241.7609420453923 | 15 | 2.5% |
| 365.55388600261153 | 11 | 1.8% |
| 4225.163801327021 | 11 | 1.8% |
| 4008.084105458271 | 10 | 1.7% |
| 4819.432257301775 | 10 | 1.7% |
| 4954.407240854524 | 8 | 1.3% |
| Other values (258) | 424 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 1043 | |
| 4 | 1043 | |
| 5 | 1013 | |
| 3 | 1003 | |
| 0 | 1000 | |
| 1 | 943 | |
| 8 | 927 | |
| 7 | 918 | |
| 6 | 832 | |
| 9 | 785 | |
| Other values (4) | 597 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 9507 | |
| Other Punctuation | 594 | 5.9% |
| Uppercase Letter | 3 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1043 | |
| 4 | 1043 | |
| 5 | 1013 | |
| 3 | 1003 | |
| 0 | 1000 | |
| 1 | 943 | |
| 8 | 927 | |
| 7 | 918 | |
| 6 | 832 | |
| 9 | 785 |
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 1 | |
| M | 1 | |
| L | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 594 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 10101 | |
| Latin | 3 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 1043 | |
| 4 | 1043 | |
| 5 | 1013 | |
| 3 | 1003 | |
| 0 | 1000 | |
| 1 | 943 | |
| 8 | 927 | |
| 7 | 918 | |
| 6 | 832 | |
| 9 | 785 |
Latin
| Value | Count | Frequency (%) |
| E | 1 | |
| M | 1 | |
| L | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10104 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 1043 | |
| 4 | 1043 | |
| 5 | 1013 | |
| 3 | 1003 | |
| 0 | 1000 | |
| 1 | 943 | |
| 8 | 927 | |
| 7 | 918 | |
| 6 | 832 | |
| 9 | 785 | |
| Other values (4) | 597 |
issue
Text
| Distinct | 228 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 101 |
| Missing (%) | < 0.1% |
| Memory size | 7.5 MiB |
Length
| Max length | 207 |
|---|---|
| Median length | 48 |
| Mean length | 55.97726907 |
| Min length | 17 |
Unique
| Unique | 42 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT;GEODETIC_DATUM_ASSUMED_WGS84;CONTINENT_COORDINATE_MISMATCH |
|---|---|
| 2nd row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
| 3rd row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
| 4th row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
| 5th row | OCCURRENCE_STATUS_INFERRED_FROM_INDIVIDUAL_COUNT |
| Value | Count | Frequency (%) |
| occurrence_status_inferred_from_individual_count | 746709 | |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84 | 122708 | 12.4% |
| occurrence_status_inferred_from_individual_count;taxon_match_higherrank | 20378 | 2.1% |
| occurrence_status_inferred_from_individual_count;recorded_date_mismatch | 20160 | 2.0% |
| occurrence_status_inferred_from_individual_count;continent_derived_from_country;continent_invalid | 18108 | 1.8% |
| occurrence_status_inferred_from_individual_count;continent_country_mismatch | 10576 | 1.1% |
| occurrence_status_inferred_from_individual_count;taxon_match_fuzzy | 10400 | 1.1% |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;geodetic_datum_invalid | 4482 | 0.5% |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;taxon_match_higherrank | 4056 | 0.4% |
| occurrence_status_inferred_from_individual_count;geodetic_datum_assumed_wgs84;continent_derived_from_coordinates;continent_invalid | 3294 | 0.3% |
| Other values (218) | 27430 | 2.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| _ | 5687389 | |
| R | 5152841 | |
| E | 4657864 | 8.4% |
| I | 4362875 | 7.9% |
| C | 4341521 | 7.8% |
| N | 4323540 | 7.8% |
| U | 4305475 | 7.8% |
| T | 3629406 | 6.6% |
| D | 3603366 | 6.5% |
| O | 3347835 | 6.1% |
| Other values (29) | 11910279 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 49042028 | |
| Connector Punctuation | 5687389 | 10.3% |
| Other Punctuation | 301057 | 0.5% |
| Decimal Number | 291915 | 0.5% |
| Dash Punctuation | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 5152841 | |
| E | 4657864 | |
| I | 4362875 | |
| C | 4341521 | |
| N | 4323540 | |
| U | 4305475 | |
| T | 3629406 | |
| D | 3603366 | |
| O | 3347835 | 6.8% |
| A | 2523853 | 5.1% |
| Other values (14) | 8793452 |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 145950 | |
| 8 | 145949 | |
| 2 | 4 | < 0.1% |
| 0 | 3 | < 0.1% |
| 7 | 3 | < 0.1% |
| 1 | 2 | < 0.1% |
| 3 | 1 | < 0.1% |
| 5 | 1 | < 0.1% |
| 9 | 1 | < 0.1% |
| 6 | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 301054 | |
| : | 2 | < 0.1% |
| . | 1 | < 0.1% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 5687389 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 49042028 | |
| Common | 6280363 | 11.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 5152841 | |
| E | 4657864 | |
| I | 4362875 | |
| C | 4341521 | |
| N | 4323540 | |
| U | 4305475 | |
| T | 3629406 | |
| D | 3603366 | |
| O | 3347835 | 6.8% |
| A | 2523853 | 5.1% |
| Other values (14) | 8793452 |
Common
| Value | Count | Frequency (%) |
| _ | 5687389 | |
| ; | 301054 | 4.8% |
| 4 | 145950 | 2.3% |
| 8 | 145949 | 2.3% |
| 2 | 4 | < 0.1% |
| 0 | 3 | < 0.1% |
| 7 | 3 | < 0.1% |
| : | 2 | < 0.1% |
| - | 2 | < 0.1% |
| 1 | 2 | < 0.1% |
| Other values (5) | 5 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 55322391 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| _ | 5687389 | |
| R | 5152841 | |
| E | 4657864 | 8.4% |
| I | 4362875 | 7.9% |
| C | 4341521 | 7.8% |
| N | 4323540 | 7.8% |
| U | 4305475 | 7.8% |
| T | 3629406 | 6.6% |
| D | 3603366 | 6.5% |
| O | 3347835 | 6.1% |
| Other values (29) | 11910279 |
mediaType
Text
Missing 
| Distinct | 46 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 69371 |
| Missing (%) | 7.0% |
| Memory size | 7.5 MiB |
Length
| Max length | 571 |
|---|---|
| Median length | 10 |
| Mean length | 10.82667505 |
| Min length | 10 |
Unique
| Unique | 12 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | StillImage |
|---|---|
| 2nd row | StillImage |
| 3rd row | StillImage |
| 4th row | StillImage |
| 5th row | StillImage |
| Value | Count | Frequency (%) |
| stillimage | 864575 | |
| stillimage;stillimage | 50344 | 5.5% |
| stillimage;stillimage;stillimage | 1419 | 0.2% |
| stillimage;stillimage;stillimage;stillimage | 997 | 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage | 485 | 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 345 | < 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 233 | < 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 154 | < 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 88 | < 0.1% |
| stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage;stillimage | 61 | < 0.1% |
| Other values (36) | 330 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| l | 1976192 | |
| S | 988096 | |
| i | 988096 | |
| I | 988096 | |
| m | 988096 | |
| a | 988096 | |
| g | 988096 | |
| e | 988096 | |
| t | 988096 | |
| ; | 69066 | 0.7% |
| Other values (12) | 24 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 7904768 | |
| Uppercase Letter | 1976194 | 19.9% |
| Other Punctuation | 69069 | 0.7% |
| Decimal Number | 17 | < 0.1% |
| Dash Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 1976192 | |
| i | 988096 | |
| m | 988096 | |
| a | 988096 | |
| g | 988096 | |
| e | 988096 | |
| t | 988096 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 5 | |
| 1 | 4 | |
| 4 | 3 | |
| 0 | 2 | 11.8% |
| 8 | 1 | 5.9% |
| 3 | 1 | 5.9% |
| 6 | 1 | 5.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 988096 | |
| I | 988096 | |
| T | 1 | < 0.1% |
| Z | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| ; | 69066 | |
| : | 2 | < 0.1% |
| . | 1 | < 0.1% |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 9880962 | |
| Common | 69088 | 0.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| l | 1976192 | |
| S | 988096 | |
| i | 988096 | |
| I | 988096 | |
| m | 988096 | |
| a | 988096 | |
| g | 988096 | |
| e | 988096 | |
| t | 988096 | |
| T | 1 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| ; | 69066 | |
| 2 | 5 | < 0.1% |
| 1 | 4 | < 0.1% |
| 4 | 3 | < 0.1% |
| : | 2 | < 0.1% |
| 0 | 2 | < 0.1% |
| - | 2 | < 0.1% |
| 8 | 1 | < 0.1% |
| 3 | 1 | < 0.1% |
| . | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 9950050 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| l | 1976192 | |
| S | 988096 | |
| i | 988096 | |
| I | 988096 | |
| m | 988096 | |
| a | 988096 | |
| g | 988096 | |
| e | 988096 | |
| t | 988096 | |
| ; | 69066 | 0.7% |
| Other values (12) | 24 | < 0.1% |
hasCoordinate
Text
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1 |
| Missing (%) | < 0.1% |
| Memory size | 7.5 MiB |
Length
| Max length | 34 |
|---|---|
| Median length | 5 |
| Mean length | 4.850901608 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | true |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 841002 | |
| true | 147398 | 14.9% |
| rollinia | 1 | < 0.1% |
| edulis | 1 | < 0.1% |
| var | 1 | < 0.1% |
| acuta | 1 | < 0.1% |
| r.e.fr | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 988401 | |
| a | 841006 | |
| l | 841005 | |
| s | 841003 | |
| f | 841002 | |
| r | 147400 | 3.1% |
| u | 147400 | 3.1% |
| t | 147399 | 3.1% |
| . | 4 | < 0.1% |
| 4 | < 0.1% | |
| Other values (9) | 12 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4794624 | |
| Other Punctuation | 4 | < 0.1% |
| Space Separator | 4 | < 0.1% |
| Uppercase Letter | 4 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 988401 | |
| a | 841006 | |
| l | 841005 | |
| s | 841003 | |
| f | 841002 | |
| r | 147400 | 3.1% |
| u | 147400 | 3.1% |
| t | 147399 | 3.1% |
| i | 3 | < 0.1% |
| c | 1 | < 0.1% |
| Other values (4) | 4 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 2 | |
| E | 1 | |
| F | 1 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 4 |
Space Separator
| Value | Count | Frequency (%) |
| 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4794628 | |
| Common | 8 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 988401 | |
| a | 841006 | |
| l | 841005 | |
| s | 841003 | |
| f | 841002 | |
| r | 147400 | 3.1% |
| u | 147400 | 3.1% |
| t | 147399 | 3.1% |
| i | 3 | < 0.1% |
| R | 2 | < 0.1% |
| Other values (7) | 7 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| . | 4 | |
| 4 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4794636 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 988401 | |
| a | 841006 | |
| l | 841005 | |
| s | 841003 | |
| f | 841002 | |
| r | 147400 | 3.1% |
| u | 147400 | 3.1% |
| t | 147399 | 3.1% |
| . | 4 | < 0.1% |
| 4 | < 0.1% | |
| Other values (9) | 12 | < 0.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Memory size | 7.5 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.99511331 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 983569 | |
| true | 4830 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 988399 | |
| f | 983569 | |
| a | 983569 | |
| l | 983569 | |
| s | 983569 | |
| t | 4830 | 0.1% |
| r | 4830 | 0.1% |
| u | 4830 | 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4937165 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 988399 | |
| f | 983569 | |
| a | 983569 | |
| l | 983569 | |
| s | 983569 | |
| t | 4830 | 0.1% |
| r | 4830 | 0.1% |
| u | 4830 | 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4937165 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 988399 | |
| f | 983569 | |
| a | 983569 | |
| l | 983569 | |
| s | 983569 | |
| t | 4830 | 0.1% |
| r | 4830 | 0.1% |
| u | 4830 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4937165 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 988399 | |
| f | 983569 | |
| a | 983569 | |
| l | 983569 | |
| s | 983569 | |
| t | 4830 | 0.1% |
| r | 4830 | 0.1% |
| u | 4830 | 0.1% |
taxonKey
Text
| Distinct | 171484 |
|---|---|
| Distinct (%) | 17.3% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Memory size | 7.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 6.964352453 |
| Min length | 1 |
Unique
| Unique | 76155 ? |
|---|---|
| Unique (%) | 7.7% |
Sample
| 1st row | 2654951 |
|---|---|
| 2nd row | 2947270 |
| 3rd row | 2765389 |
| 4th row | 3687053 |
| 5th row | 7355530 |
| Value | Count | Frequency (%) |
| 8176985 | 3995 | 0.4% |
| 0 | 3366 | 0.3% |
| 2655370 | 1333 | 0.1% |
| 6 | 1163 | 0.1% |
| 3219107 | 1082 | 0.1% |
| 5426909 | 1064 | 0.1% |
| 5426949 | 994 | 0.1% |
| 4270616 | 933 | 0.1% |
| 2655497 | 809 | 0.1% |
| 2654437 | 772 | 0.1% |
| Other values (171474) | 972888 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 890366 | |
| 3 | 810909 | |
| 5 | 728630 | |
| 7 | 726369 | |
| 0 | 644591 | |
| 6 | 635380 | |
| 8 | 634801 | |
| 1 | 619707 | |
| 9 | 605275 | |
| 4 | 587531 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6883559 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 890366 | |
| 3 | 810909 | |
| 5 | 728630 | |
| 7 | 726369 | |
| 0 | 644591 | |
| 6 | 635380 | |
| 8 | 634801 | |
| 1 | 619707 | |
| 9 | 605275 | |
| 4 | 587531 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6883559 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 890366 | |
| 3 | 810909 | |
| 5 | 728630 | |
| 7 | 726369 | |
| 0 | 644591 | |
| 6 | 635380 | |
| 8 | 634801 | |
| 1 | 619707 | |
| 9 | 605275 | |
| 4 | 587531 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6883559 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 890366 | |
| 3 | 810909 | |
| 5 | 728630 | |
| 7 | 726369 | |
| 0 | 644591 | |
| 6 | 635380 | |
| 8 | 634801 | |
| 1 | 619707 | |
| 9 | 605275 | |
| 4 | 587531 |
acceptedTaxonKey
Text
| Distinct | 141149 |
|---|---|
| Distinct (%) | 14.3% |
| Missing | 3368 |
| Missing (%) | 0.3% |
| Memory size | 7.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.000687286 |
| Min length | 1 |
Unique
| Unique | 52485 ? |
|---|---|
| Unique (%) | 5.3% |
Sample
| 1st row | 2654944 |
|---|---|
| 2nd row | 2947270 |
| 3rd row | 10416230 |
| 4th row | 3687053 |
| 5th row | 7355530 |
| Value | Count | Frequency (%) |
| 7947184 | 4001 | 0.4% |
| 2655370 | 1415 | 0.1% |
| 6 | 1163 | 0.1% |
| 3219107 | 1082 | 0.1% |
| 5426909 | 1064 | 0.1% |
| 2702678 | 1008 | 0.1% |
| 5426949 | 994 | 0.1% |
| 2654909 | 868 | 0.1% |
| 2655497 | 809 | 0.1% |
| 5426932 | 760 | 0.1% |
| Other values (141139) | 971870 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 898604 | |
| 3 | 797962 | |
| 7 | 733166 | |
| 5 | 715553 | |
| 0 | 648866 | |
| 1 | 638677 | |
| 8 | 635333 | |
| 6 | 626367 | |
| 9 | 613562 | |
| 4 | 587820 | |
| Other values (5) | 5 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6895910 | |
| Lowercase Letter | 5 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 898604 | |
| 3 | 797962 | |
| 7 | 733166 | |
| 5 | 715553 | |
| 0 | 648866 | |
| 1 | 638677 | |
| 8 | 635333 | |
| 6 | 626367 | |
| 9 | 613562 | |
| 4 | 587820 |
Lowercase Letter
| Value | Count | Frequency (%) |
| f | 1 | |
| a | 1 | |
| l | 1 | |
| s | 1 | |
| e | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6895910 | |
| Latin | 5 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 898604 | |
| 3 | 797962 | |
| 7 | 733166 | |
| 5 | 715553 | |
| 0 | 648866 | |
| 1 | 638677 | |
| 8 | 635333 | |
| 6 | 626367 | |
| 9 | 613562 | |
| 4 | 587820 |
Latin
| Value | Count | Frequency (%) |
| f | 1 | |
| a | 1 | |
| l | 1 | |
| s | 1 | |
| e | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6895915 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 898604 | |
| 3 | 797962 | |
| 7 | 733166 | |
| 5 | 715553 | |
| 0 | 648866 | |
| 1 | 638677 | |
| 8 | 635333 | |
| 6 | 626367 | |
| 9 | 613562 | |
| 4 | 587820 | |
| Other values (5) | 5 | < 0.1% |
kingdomKey
Text
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 7.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 1 |
| Mean length | 1.000012141 |
| Min length | 1 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 6 |
|---|---|
| 2nd row | 6 |
| 3rd row | 6 |
| 4th row | 6 |
| 5th row | 6 |
| Value | Count | Frequency (%) |
| 6 | 907311 | |
| 5 | 48945 | 5.0% |
| 4 | 17041 | 1.7% |
| 3 | 11701 | 1.2% |
| 0 | 3366 | 0.3% |
| 7 | 31 | < 0.1% |
| 1 | 4 | < 0.1% |
| latin_america | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 907311 | |
| 5 | 48945 | 5.0% |
| 4 | 17041 | 1.7% |
| 3 | 11701 | 1.2% |
| 0 | 3366 | 0.3% |
| 7 | 31 | < 0.1% |
| 1 | 4 | < 0.1% |
| A | 3 | < 0.1% |
| I | 2 | < 0.1% |
| L | 1 | < 0.1% |
| Other values (7) | 7 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 988399 | |
| Uppercase Letter | 12 | < 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 3 | |
| I | 2 | |
| L | 1 | 8.3% |
| T | 1 | 8.3% |
| N | 1 | 8.3% |
| M | 1 | 8.3% |
| E | 1 | 8.3% |
| R | 1 | 8.3% |
| C | 1 | 8.3% |
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 907311 | |
| 5 | 48945 | 5.0% |
| 4 | 17041 | 1.7% |
| 3 | 11701 | 1.2% |
| 0 | 3366 | 0.3% |
| 7 | 31 | < 0.1% |
| 1 | 4 | < 0.1% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 988400 | |
| Latin | 12 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 3 | |
| I | 2 | |
| L | 1 | 8.3% |
| T | 1 | 8.3% |
| N | 1 | 8.3% |
| M | 1 | 8.3% |
| E | 1 | 8.3% |
| R | 1 | 8.3% |
| C | 1 | 8.3% |
Common
| Value | Count | Frequency (%) |
| 6 | 907311 | |
| 5 | 48945 | 5.0% |
| 4 | 17041 | 1.7% |
| 3 | 11701 | 1.2% |
| 0 | 3366 | 0.3% |
| 7 | 31 | < 0.1% |
| 1 | 4 | < 0.1% |
| _ | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 988412 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 907311 | |
| 5 | 48945 | 5.0% |
| 4 | 17041 | 1.7% |
| 3 | 11701 | 1.2% |
| 0 | 3366 | 0.3% |
| 7 | 31 | < 0.1% |
| 1 | 4 | < 0.1% |
| A | 3 | < 0.1% |
| I | 2 | < 0.1% |
| L | 1 | < 0.1% |
| Other values (7) | 7 | < 0.1% |
phylumKey
Text
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4754 |
| Missing (%) | 0.5% |
| Memory size | 7.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 7 |
| Mean length | 6.258306833 |
| Min length | 1 |
Unique
| Unique | 5 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 106 |
|---|---|
| 2nd row | 7707728 |
| 3rd row | 7707728 |
| 4th row | 7707728 |
| 5th row | 7707728 |
| Value | Count | Frequency (%) |
| 7707728 | 830617 | |
| 95 | 48276 | 4.9% |
| 35 | 32695 | 3.3% |
| 106 | 26385 | 2.7% |
| 98 | 15149 | 1.5% |
| 68 | 11694 | 1.2% |
| 36 | 9268 | 0.9% |
| 9 | 5937 | 0.6% |
| 8770992 | 1887 | 0.2% |
| 7819616 | 1126 | 0.1% |
| Other values (14) | 614 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 7 | 3327391 | |
| 8 | 860477 | 14.0% |
| 0 | 858902 | 14.0% |
| 2 | 832505 | 13.5% |
| 5 | 80985 | 1.3% |
| 9 | 74277 | 1.2% |
| 6 | 49606 | 0.8% |
| 3 | 42560 | 0.7% |
| 1 | 28785 | 0.5% |
| 4 | 470 | < 0.1% |
| Other values (11) | 13 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6155958 | |
| Uppercase Letter | 12 | < 0.1% |
| Connector Punctuation | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 3327391 | |
| 8 | 860477 | 14.0% |
| 0 | 858902 | 14.0% |
| 2 | 832505 | 13.5% |
| 5 | 80985 | 1.3% |
| 9 | 74277 | 1.2% |
| 6 | 49606 | 0.8% |
| 3 | 42560 | 0.7% |
| 1 | 28785 | 0.5% |
| 4 | 470 | < 0.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2 | |
| R | 2 | |
| I | 1 | |
| E | 1 | |
| M | 1 | |
| N | 1 | |
| H | 1 | |
| T | 1 | |
| O | 1 | |
| C | 1 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6155959 | |
| Latin | 12 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 7 | 3327391 | |
| 8 | 860477 | 14.0% |
| 0 | 858902 | 14.0% |
| 2 | 832505 | 13.5% |
| 5 | 80985 | 1.3% |
| 9 | 74277 | 1.2% |
| 6 | 49606 | 0.8% |
| 3 | 42560 | 0.7% |
| 1 | 28785 | 0.5% |
| 4 | 470 | < 0.1% |
Latin
| Value | Count | Frequency (%) |
| A | 2 | |
| R | 2 | |
| I | 1 | |
| E | 1 | |
| M | 1 | |
| N | 1 | |
| H | 1 | |
| T | 1 | |
| O | 1 | |
| C | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6155971 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 7 | 3327391 | |
| 8 | 860477 | 14.0% |
| 0 | 858902 | 14.0% |
| 2 | 832505 | 13.5% |
| 5 | 80985 | 1.3% |
| 9 | 74277 | 1.2% |
| 6 | 49606 | 0.8% |
| 3 | 42560 | 0.7% |
| 1 | 28785 | 0.5% |
| 4 | 470 | < 0.1% |
| Other values (11) | 13 | < 0.1% |
classKey
Text
| Distinct | 68 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 5481 |
| Missing (%) | 0.6% |
| Memory size | 7.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 3 |
| Mean length | 3.360726854 |
| Min length | 3 |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 342 |
|---|---|
| 2nd row | 220 |
| 3rd row | 196 |
| 4th row | 220 |
| 5th row | 196 |
| Value | Count | Frequency (%) |
| 220 | 565617 | |
| 196 | 199036 | 20.2% |
| 7228684 | 54963 | 5.6% |
| 180 | 44421 | 4.5% |
| 327 | 29396 | 3.0% |
| 342 | 25770 | 2.6% |
| 10774316 | 11282 | 1.1% |
| 7947184 | 8448 | 0.9% |
| 195 | 8422 | 0.9% |
| 7073593 | 6544 | 0.7% |
| Other values (58) | 29022 | 3.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 1314764 | |
| 0 | 633807 | |
| 1 | 302985 | 9.2% |
| 6 | 272083 | 8.2% |
| 9 | 235552 | 7.1% |
| 8 | 166999 | 5.1% |
| 7 | 141061 | 4.3% |
| 4 | 123266 | 3.7% |
| 3 | 88216 | 2.7% |
| 5 | 24596 | 0.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3303329 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1314764 | |
| 0 | 633807 | |
| 1 | 302985 | 9.2% |
| 6 | 272083 | 8.2% |
| 9 | 235552 | 7.1% |
| 8 | 166999 | 5.1% |
| 7 | 141061 | 4.3% |
| 4 | 123266 | 3.7% |
| 3 | 88216 | 2.7% |
| 5 | 24596 | 0.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3303329 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 1314764 | |
| 0 | 633807 | |
| 1 | 302985 | 9.2% |
| 6 | 272083 | 8.2% |
| 9 | 235552 | 7.1% |
| 8 | 166999 | 5.1% |
| 7 | 141061 | 4.3% |
| 4 | 123266 | 3.7% |
| 3 | 88216 | 2.7% |
| 5 | 24596 | 0.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3303329 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 1314764 | |
| 0 | 633807 | |
| 1 | 302985 | 9.2% |
| 6 | 272083 | 8.2% |
| 9 | 235552 | 7.1% |
| 8 | 166999 | 5.1% |
| 7 | 141061 | 4.3% |
| 4 | 123266 | 3.7% |
| 3 | 88216 | 2.7% |
| 5 | 24596 | 0.7% |
orderKey
Text
Missing 
| Distinct | 358 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 10134 |
| Missing (%) | 1.0% |
| Memory size | 7.5 MiB |
Length
| Max length | 68 |
|---|---|
| Median length | 8 |
| Mean length | 3.762241022 |
| Min length | 3 |
Unique
| Unique | 39 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 670 |
|---|---|
| 2nd row | 1370 |
| 3rd row | 553 |
| 4th row | 7224021 |
| 5th row | 1369 |
| Value | Count | Frequency (%) |
| 1369 | 153750 | 15.7% |
| 414 | 83320 | 8.5% |
| 408 | 58318 | 6.0% |
| 1370 | 55218 | 5.6% |
| 1414 | 46323 | 4.7% |
| 392 | 42295 | 4.3% |
| 412 | 39541 | 4.0% |
| 690 | 34933 | 3.6% |
| 422 | 32482 | 3.3% |
| 691 | 28326 | 2.9% |
| Other values (353) | 403767 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 767886 | |
| 4 | 547702 | |
| 3 | 458030 | |
| 9 | 405888 | |
| 6 | 375958 | |
| 2 | 317418 | |
| 0 | 298525 | 8.1% |
| 7 | 202999 | 5.5% |
| 5 | 169746 | 4.6% |
| 8 | 136260 | 3.7% |
| Other values (21) | 68 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 3680412 | |
| Lowercase Letter | 52 | < 0.1% |
| Space Separator | 5 | < 0.1% |
| Uppercase Letter | 5 | < 0.1% |
| Other Punctuation | 4 | < 0.1% |
| Open Punctuation | 1 | < 0.1% |
| Close Punctuation | 1 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 10 | |
| n | 9 | |
| e | 8 | |
| o | 6 | |
| l | 5 | |
| i | 3 | 5.8% |
| s | 2 | 3.8% |
| d | 2 | 3.8% |
| t | 2 | 3.8% |
| c | 2 | 3.8% |
| Other values (3) | 3 | 5.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 767886 | |
| 4 | 547702 | |
| 3 | 458030 | |
| 9 | 405888 | |
| 6 | 375958 | |
| 2 | 317418 | |
| 0 | 298525 | 8.1% |
| 7 | 202999 | 5.5% |
| 5 | 169746 | 4.6% |
| 8 | 136260 | 3.7% |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2 | |
| D | 1 | |
| P | 1 | |
| M | 1 |
Space Separator
| Value | Count | Frequency (%) |
| 5 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 4 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 3680423 | |
| Latin | 57 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 10 | |
| n | 9 | |
| e | 8 | |
| o | 6 | |
| l | 5 | |
| i | 3 | 5.3% |
| s | 2 | 3.5% |
| d | 2 | 3.5% |
| t | 2 | 3.5% |
| A | 2 | 3.5% |
| Other values (7) | 8 |
Common
| Value | Count | Frequency (%) |
| 1 | 767886 | |
| 4 | 547702 | |
| 3 | 458030 | |
| 9 | 405888 | |
| 6 | 375958 | |
| 2 | 317418 | |
| 0 | 298525 | 8.1% |
| 7 | 202999 | 5.5% |
| 5 | 169746 | 4.6% |
| 8 | 136260 | 3.7% |
| Other values (4) | 11 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3680480 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 767886 | |
| 4 | 547702 | |
| 3 | 458030 | |
| 9 | 405888 | |
| 6 | 375958 | |
| 2 | 317418 | |
| 0 | 298525 | 8.1% |
| 7 | 202999 | 5.5% |
| 5 | 169746 | 4.6% |
| 8 | 136260 | 3.7% |
| Other values (21) | 68 | < 0.1% |
familyKey
Text
Missing 
| Distinct | 1293 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 10432 |
| Missing (%) | 1.1% |
| Memory size | 7.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 4 |
| Mean length | 4.192720636 |
| Min length | 4 |
Unique
| Unique | 122 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 4376199 |
|---|---|
| 2nd row | 5386 |
| 3rd row | 2763195 |
| 4th row | 6669 |
| 5th row | 3073 |
| Value | Count | Frequency (%) |
| 3073 | 110118 | 11.3% |
| 3065 | 78427 | 8.0% |
| 5386 | 51638 | 5.3% |
| 7708 | 30498 | 3.1% |
| 8798 | 26201 | 2.7% |
| 6683 | 16271 | 1.7% |
| 6685 | 14761 | 1.5% |
| 5015 | 14530 | 1.5% |
| 8305 | 14370 | 1.5% |
| 2497 | 13720 | 1.4% |
| Other values (1283) | 607436 |
Most occurring characters
| Value | Count | Frequency (%) |
| 6 | 656925 | |
| 3 | 648759 | |
| 7 | 486759 | |
| 8 | 420571 | |
| 0 | 419501 | |
| 5 | 351922 | |
| 2 | 334385 | |
| 4 | 300442 | |
| 9 | 244987 | 6.0% |
| 1 | 236097 | 5.8% |
| Other values (6) | 7 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 4100348 | |
| Lowercase Letter | 6 | < 0.1% |
| Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 6 | 656925 | |
| 3 | 648759 | |
| 7 | 486759 | |
| 8 | 420571 | |
| 0 | 419501 | |
| 5 | 351922 | |
| 2 | 334385 | |
| 4 | 300442 | |
| 9 | 244987 | 6.0% |
| 1 | 236097 | 5.8% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 | |
| l | 1 | |
| n | 1 | |
| t | 1 | |
| e | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| P | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 4100348 | |
| Latin | 7 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 6 | 656925 | |
| 3 | 648759 | |
| 7 | 486759 | |
| 8 | 420571 | |
| 0 | 419501 | |
| 5 | 351922 | |
| 2 | 334385 | |
| 4 | 300442 | |
| 9 | 244987 | 6.0% |
| 1 | 236097 | 5.8% |
Latin
| Value | Count | Frequency (%) |
| a | 2 | |
| P | 1 | |
| l | 1 | |
| n | 1 | |
| t | 1 | |
| e | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4100355 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 6 | 656925 | |
| 3 | 648759 | |
| 7 | 486759 | |
| 8 | 420571 | |
| 0 | 419501 | |
| 5 | 351922 | |
| 2 | 334385 | |
| 4 | 300442 | |
| 9 | 244987 | 6.0% |
| 1 | 236097 | 5.8% |
| Other values (6) | 7 | < 0.1% |
genusKey
Text
Missing 
| Distinct | 14325 |
|---|---|
| Distinct (%) | 1.5% |
| Missing | 15344 |
| Missing (%) | 1.6% |
| Memory size | 7.5 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 7 |
| Mean length | 7.02022901 |
| Min length | 7 |
Unique
| Unique | 2132 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | 2654889 |
|---|---|
| 2nd row | 2947262 |
| 3rd row | 2763692 |
| 4th row | 3032531 |
| 5th row | 7822478 |
| Value | Count | Frequency (%) |
| 2721893 | 12742 | 1.3% |
| 3188558 | 8772 | 0.9% |
| 2607519 | 6873 | 0.7% |
| 2704173 | 6684 | 0.7% |
| 2713455 | 6044 | 0.6% |
| 2705540 | 5820 | 0.6% |
| 2928997 | 5538 | 0.6% |
| 2705322 | 5205 | 0.5% |
| 2702537 | 4464 | 0.5% |
| 2650583 | 4297 | 0.4% |
| Other values (14315) | 906619 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 1008029 | |
| 3 | 842317 | |
| 7 | 719389 | |
| 8 | 652757 | |
| 0 | 646719 | |
| 9 | 642294 | |
| 1 | 638239 | |
| 6 | 608888 | |
| 5 | 588567 | |
| 4 | 483879 | |
| Other values (10) | 12 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6831078 | |
| Lowercase Letter | 11 | < 0.1% |
| Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1008029 | |
| 3 | 842317 | |
| 7 | 719389 | |
| 8 | 652757 | |
| 0 | 646719 | |
| 9 | 642294 | |
| 1 | 638239 | |
| 6 | 608888 | |
| 5 | 588567 | |
| 4 | 483879 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 | |
| h | 2 | |
| o | 1 | |
| y | 1 | |
| p | 1 | |
| e | 1 | |
| c | 1 | |
| r | 1 | |
| t | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6831078 | |
| Latin | 12 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 1008029 | |
| 3 | 842317 | |
| 7 | 719389 | |
| 8 | 652757 | |
| 0 | 646719 | |
| 9 | 642294 | |
| 1 | 638239 | |
| 6 | 608888 | |
| 5 | 588567 | |
| 4 | 483879 |
Latin
| Value | Count | Frequency (%) |
| a | 2 | |
| h | 2 | |
| o | 1 | |
| y | 1 | |
| p | 1 | |
| T | 1 | |
| e | 1 | |
| c | 1 | |
| r | 1 | |
| t | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6831090 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 1008029 | |
| 3 | 842317 | |
| 7 | 719389 | |
| 8 | 652757 | |
| 0 | 646719 | |
| 9 | 642294 | |
| 1 | 638239 | |
| 6 | 608888 | |
| 5 | 588567 | |
| 4 | 483879 | |
| Other values (10) | 12 | < 0.1% |
subgenusKey
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 13 |
| Min length | 13 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | Magnoliopsida |
|---|
| Value | Count | Frequency (%) |
| magnoliopsida | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2 | |
| o | 2 | |
| i | 2 | |
| M | 1 | |
| g | 1 | |
| n | 1 | |
| l | 1 | |
| p | 1 | |
| s | 1 | |
| d | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12 | |
| Uppercase Letter | 1 | 7.7% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 | |
| o | 2 | |
| i | 2 | |
| g | 1 | |
| n | 1 | |
| l | 1 | |
| p | 1 | |
| s | 1 | |
| d | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2 | |
| o | 2 | |
| i | 2 | |
| M | 1 | |
| g | 1 | |
| n | 1 | |
| l | 1 | |
| p | 1 | |
| s | 1 | |
| d | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2 | |
| o | 2 | |
| i | 2 | |
| M | 1 | |
| g | 1 | |
| n | 1 | |
| l | 1 | |
| p | 1 | |
| s | 1 | |
| d | 1 |
speciesKey
Text
Missing 
| Distinct | 126812 |
|---|---|
| Distinct (%) | 13.9% |
| Missing | 75442 |
| Missing (%) | 7.6% |
| Memory size | 7.5 MiB |
Length
| Max length | 11 |
|---|---|
| Median length | 7 |
| Mean length | 7.027005564 |
| Min length | 7 |
Unique
| Unique | 46507 ? |
|---|---|
| Unique (%) | 5.1% |
Sample
| 1st row | 2654944 |
|---|---|
| 2nd row | 2947270 |
| 3rd row | 10416230 |
| 4th row | 3687053 |
| 5th row | 7355530 |
| Value | Count | Frequency (%) |
| 2655370 | 1415 | 0.2% |
| 3219107 | 1082 | 0.1% |
| 5426909 | 1064 | 0.1% |
| 2702678 | 1008 | 0.1% |
| 5426949 | 994 | 0.1% |
| 2704276 | 943 | 0.1% |
| 2654909 | 868 | 0.1% |
| 2655497 | 809 | 0.1% |
| 5426932 | 760 | 0.1% |
| 8225325 | 689 | 0.1% |
| Other values (126802) | 903328 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 829656 | |
| 3 | 752536 | |
| 5 | 677478 | |
| 7 | 670460 | |
| 0 | 607378 | |
| 8 | 599660 | |
| 1 | 599651 | |
| 9 | 568671 | |
| 6 | 565498 | |
| 4 | 544376 | |
| Other values (9) | 11 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 6415364 | |
| Lowercase Letter | 10 | < 0.1% |
| Uppercase Letter | 1 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 829656 | |
| 3 | 752536 | |
| 5 | 677478 | |
| 7 | 670460 | |
| 0 | 607378 | |
| 8 | 599660 | |
| 1 | 599651 | |
| 9 | 568671 | |
| 6 | 565498 | |
| 4 | 544376 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2 | |
| l | 2 | |
| g | 1 | |
| n | 1 | |
| o | 1 | |
| i | 1 | |
| e | 1 | |
| s | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 6415364 | |
| Latin | 11 | < 0.1% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 829656 | |
| 3 | 752536 | |
| 5 | 677478 | |
| 7 | 670460 | |
| 0 | 607378 | |
| 8 | 599660 | |
| 1 | 599651 | |
| 9 | 568671 | |
| 6 | 565498 | |
| 4 | 544376 |
Latin
| Value | Count | Frequency (%) |
| a | 2 | |
| l | 2 | |
| M | 1 | |
| g | 1 | |
| n | 1 | |
| o | 1 | |
| i | 1 | |
| e | 1 | |
| s | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6415375 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 829656 | |
| 3 | 752536 | |
| 5 | 677478 | |
| 7 | 670460 | |
| 0 | 607378 | |
| 8 | 599660 | |
| 1 | 599651 | |
| 9 | 568671 | |
| 6 | 565498 | |
| 4 | 544376 | |
| Other values (9) | 11 | < 0.1% |
species
Text
Missing 
| Distinct | 126534 |
|---|---|
| Distinct (%) | 13.9% |
| Missing | 75443 |
| Missing (%) | 7.6% |
| Memory size | 7.5 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 32 |
| Mean length | 18.99834275 |
| Min length | 8 |
Unique
| Unique | 46337 ? |
|---|---|
| Unique (%) | 5.1% |
Sample
| 1st row | Phymatolithon calcareum |
|---|---|
| 2nd row | Amicia glandulosa |
| 3rd row | Callisia glandulosa |
| 4th row | Connarus steyermarkii |
| 5th row | Trichoneura grandiglumis |
| Value | Count | Frequency (%) |
| carex | 12516 | 0.7% |
| miconia | 8270 | 0.5% |
| poa | 6546 | 0.4% |
| cladonia | 6511 | 0.4% |
| cyperus | 5985 | 0.3% |
| paspalum | 5640 | 0.3% |
| solanum | 5444 | 0.3% |
| eragrostis | 5024 | 0.3% |
| dichanthelium | 4451 | 0.2% |
| asplenium | 4181 | 0.2% |
| Other values (53483) | 1761460 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2129654 | 12.3% |
| i | 1716826 | 9.9% |
| e | 1157262 | 6.7% |
| r | 1076152 | 6.2% |
| o | 1063266 | 6.1% |
| s | 1029428 | 5.9% |
| l | 993778 | 5.7% |
| n | 939467 | 5.4% |
| 913069 | 5.3% | |
| u | 892693 | 5.1% |
| Other values (44) | 5433113 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 15514709 | |
| Space Separator | 913069 | 5.3% |
| Uppercase Letter | 912981 | 5.3% |
| Dash Punctuation | 3949 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2129654 | |
| i | 1716826 | |
| e | 1157262 | 7.5% |
| r | 1076152 | 6.9% |
| o | 1063266 | 6.9% |
| s | 1029428 | 6.6% |
| l | 993778 | 6.4% |
| n | 939467 | 6.1% |
| u | 892693 | 5.8% |
| t | 778677 | 5.0% |
| Other values (16) | 3737506 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 127940 | |
| P | 122237 | |
| S | 92668 | |
| A | 81730 | 9.0% |
| M | 60331 | 6.6% |
| E | 52038 | 5.7% |
| L | 45549 | 5.0% |
| D | 43846 | 4.8% |
| B | 37463 | 4.1% |
| H | 37313 | 4.1% |
| Other values (16) | 211866 |
Space Separator
| Value | Count | Frequency (%) |
| 913069 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 3949 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16427690 | |
| Common | 917018 | 5.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2129654 | |
| i | 1716826 | 10.5% |
| e | 1157262 | 7.0% |
| r | 1076152 | 6.6% |
| o | 1063266 | 6.5% |
| s | 1029428 | 6.3% |
| l | 993778 | 6.0% |
| n | 939467 | 5.7% |
| u | 892693 | 5.4% |
| t | 778677 | 4.7% |
| Other values (42) | 4650487 |
Common
| Value | Count | Frequency (%) |
| 913069 | ||
| - | 3949 | 0.4% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17344708 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2129654 | 12.3% |
| i | 1716826 | 9.9% |
| e | 1157262 | 6.7% |
| r | 1076152 | 6.2% |
| o | 1063266 | 6.1% |
| s | 1029428 | 5.9% |
| l | 993778 | 5.7% |
| n | 939467 | 5.4% |
| 913069 | 5.3% | |
| u | 892693 | 5.1% |
| Other values (44) | 5433113 |
| Distinct | 141148 |
|---|---|
| Distinct (%) | 14.3% |
| Missing | 3368 |
| Missing (%) | 0.3% |
| Memory size | 7.5 MiB |
Length
| Max length | 145 |
|---|---|
| Median length | 98 |
| Mean length | 31.85947389 |
| Min length | 5 |
Unique
| Unique | 52484 ? |
|---|---|
| Unique (%) | 5.3% |
Sample
| 1st row | Phymatolithon calcareum (Pallas) Adey & D.L.McKibbin |
|---|---|
| 2nd row | Amicia glandulosa Kunth |
| 3rd row | Callisia glandulosa (Seub.) Christenh. & Byng |
| 4th row | Connarus steyermarkii Prance |
| 5th row | Trichoneura grandiglumis (Nees) Ekman |
| Value | Count | Frequency (%) |
| l | 161894 | 4.2% |
| 145790 | 3.8% | |
| ex | 72859 | 1.9% |
| var | 29546 | 0.8% |
| subsp | 28283 | 0.7% |
| kunth | 26788 | 0.7% |
| dc | 25626 | 0.7% |
| benth | 22744 | 0.6% |
| a.gray | 22225 | 0.6% |
| sw | 20887 | 0.5% |
| Other values (67669) | 3311496 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2883104 | 9.2% | |
| a | 2801876 | 8.9% |
| i | 2180333 | 6.9% |
| e | 1948892 | 6.2% |
| r | 1712734 | 5.5% |
| o | 1536718 | 4.9% |
| l | 1524818 | 4.9% |
| . | 1445546 | 4.6% |
| n | 1442074 | 4.6% |
| s | 1413219 | 4.5% |
| Other values (114) | 12493351 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 22800431 | |
| Uppercase Letter | 3092925 | 9.9% |
| Space Separator | 2883104 | 9.2% |
| Other Punctuation | 1639202 | 5.2% |
| Open Punctuation | 417352 | 1.3% |
| Close Punctuation | 417352 | 1.3% |
| Decimal Number | 116016 | 0.4% |
| Dash Punctuation | 13551 | < 0.1% |
| Math Symbol | 2707 | < 0.1% |
| Connector Punctuation | 25 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2801876 | |
| i | 2180333 | 9.6% |
| e | 1948892 | 8.5% |
| r | 1712734 | 7.5% |
| o | 1536718 | 6.7% |
| l | 1524818 | 6.7% |
| n | 1442074 | 6.3% |
| s | 1413219 | 6.2% |
| u | 1243776 | 5.5% |
| t | 1180783 | 5.2% |
| Other values (55) | 5815208 |
Uppercase Letter
| Value | Count | Frequency (%) |
| L | 310532 | 10.0% |
| S | 288516 | 9.3% |
| C | 270821 | 8.8% |
| P | 222781 | 7.2% |
| A | 214829 | 6.9% |
| M | 212413 | 6.9% |
| B | 199884 | 6.5% |
| H | 180134 | 5.8% |
| R | 146849 | 4.7% |
| D | 145970 | 4.7% |
| Other values (29) | 900196 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 33535 | |
| 8 | 23677 | |
| 9 | 15053 | |
| 0 | 7402 | 6.4% |
| 3 | 7186 | 6.2% |
| 2 | 7122 | 6.1% |
| 7 | 6802 | 5.9% |
| 4 | 5757 | 5.0% |
| 6 | 5008 | 4.3% |
| 5 | 4474 | 3.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 1445546 | |
| & | 145790 | 8.9% |
| , | 46048 | 2.8% |
| ' | 1818 | 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 2883104 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 417352 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 417352 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 13551 |
Math Symbol
| Value | Count | Frequency (%) |
| × | 2707 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 25 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 25893356 | |
| Common | 5489309 | 17.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2801876 | 10.8% |
| i | 2180333 | 8.4% |
| e | 1948892 | 7.5% |
| r | 1712734 | 6.6% |
| o | 1536718 | 5.9% |
| l | 1524818 | 5.9% |
| n | 1442074 | 5.6% |
| s | 1413219 | 5.5% |
| u | 1243776 | 4.8% |
| t | 1180783 | 4.6% |
| Other values (94) | 8908133 |
Common
| Value | Count | Frequency (%) |
| 2883104 | ||
| . | 1445546 | |
| ( | 417352 | 7.6% |
| ) | 417352 | 7.6% |
| & | 145790 | 2.7% |
| , | 46048 | 0.8% |
| 1 | 33535 | 0.6% |
| 8 | 23677 | 0.4% |
| 9 | 15053 | 0.3% |
| - | 13551 | 0.2% |
| Other values (10) | 48301 | 0.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 31326567 | |
| None | 56098 | 0.2% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2883104 | 9.2% | |
| a | 2801876 | 8.9% |
| i | 2180333 | 7.0% |
| e | 1948892 | 6.2% |
| r | 1712734 | 5.5% |
| o | 1536718 | 4.9% |
| l | 1524818 | 4.9% |
| . | 1445546 | 4.6% |
| n | 1442074 | 4.6% |
| s | 1413219 | 4.5% |
| Other values (61) | 12437253 |
None
| Value | Count | Frequency (%) |
| ü | 16171 | |
| é | 10022 | |
| ö | 8139 | |
| ä | 3690 | 6.6% |
| á | 3643 | 6.5% |
| × | 2707 | 4.8% |
| ø | 1855 | 3.3% |
| Á | 1839 | 3.3% |
| ó | 1209 | 2.2% |
| è | 874 | 1.6% |
| Other values (43) | 5949 | 10.6% |
| Distinct | 177770 |
|---|---|
| Distinct (%) | 18.0% |
| Missing | 3017 |
| Missing (%) | 0.3% |
| Memory size | 7.5 MiB |
Length
| Max length | 125 |
|---|---|
| Median length | 94 |
| Mean length | 19.78091812 |
| Min length | 6 |
Unique
| Unique | 81124 ? |
|---|---|
| Unique (%) | 8.2% |
Sample
| 1st row | Lithothamnion calcareum |
|---|---|
| 2nd row | Amicia glandulosa |
| 3rd row | Tripogandra glandulosa |
| 4th row | Connarus steyermarkii |
| 5th row | Trichoneura grandiglumis |
| Value | Count | Frequency (%) |
| sp | 59300 | 2.8% |
| var | 45918 | 2.2% |
| subsp | 23075 | 1.1% |
| carex | 12732 | 0.6% |
| indet | 9106 | 0.4% |
| poa | 6687 | 0.3% |
| cyperus | 6038 | 0.3% |
| cladonia | 5900 | 0.3% |
| paspalum | 5802 | 0.3% |
| miconia | 5464 | 0.3% |
| Other values (64311) | 1952155 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 2343786 | 12.0% |
| i | 1854416 | 9.5% |
| e | 1259869 | 6.5% |
| s | 1219162 | 6.3% |
| r | 1207808 | 6.2% |
| 1146792 | 5.9% | |
| o | 1138492 | 5.8% |
| l | 1071612 | 5.5% |
| n | 1027785 | 5.3% |
| u | 997253 | 5.1% |
| Other values (78) | 6224845 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 17191337 | |
| Space Separator | 1146792 | 5.9% |
| Uppercase Letter | 994154 | 5.1% |
| Other Punctuation | 150624 | 0.8% |
| Dash Punctuation | 4567 | < 0.1% |
| Decimal Number | 1516 | < 0.1% |
| Open Punctuation | 1412 | < 0.1% |
| Close Punctuation | 1412 | < 0.1% |
| Math Symbol | 6 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 2343786 | |
| i | 1854416 | |
| e | 1259869 | 7.3% |
| s | 1219162 | 7.1% |
| r | 1207808 | 7.0% |
| o | 1138492 | 6.6% |
| l | 1071612 | 6.2% |
| n | 1027785 | 6.0% |
| u | 997253 | 5.8% |
| t | 850651 | 4.9% |
| Other values (24) | 4220503 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 141099 | |
| P | 128999 | |
| S | 97726 | |
| A | 88710 | 8.9% |
| M | 63577 | 6.4% |
| L | 53431 | 5.4% |
| E | 52795 | 5.3% |
| D | 46397 | 4.7% |
| B | 42740 | 4.3% |
| H | 40423 | 4.1% |
| Other values (19) | 238257 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 489 | |
| 0 | 361 | |
| 1 | 328 | |
| 5 | 215 | |
| 7 | 31 | 2.0% |
| 3 | 30 | 2.0% |
| 9 | 29 | 1.9% |
| 8 | 17 | 1.1% |
| 6 | 12 | 0.8% |
| 4 | 4 | 0.3% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 147866 | |
| , | 1090 | 0.7% |
| ' | 972 | 0.6% |
| & | 510 | 0.3% |
| ? | 102 | 0.1% |
| " | 42 | < 0.1% |
| / | 40 | < 0.1% |
| # | 2 | < 0.1% |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1408 | |
| [ | 4 | 0.3% |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1408 | |
| ] | 4 | 0.3% |
Space Separator
| Value | Count | Frequency (%) |
| 1146792 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4567 |
Math Symbol
| Value | Count | Frequency (%) |
| × | 6 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 18185491 | |
| Common | 1306329 | 6.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 2343786 | |
| i | 1854416 | 10.2% |
| e | 1259869 | 6.9% |
| s | 1219162 | 6.7% |
| r | 1207808 | 6.6% |
| o | 1138492 | 6.3% |
| l | 1071612 | 5.9% |
| n | 1027785 | 5.7% |
| u | 997253 | 5.5% |
| t | 850651 | 4.7% |
| Other values (53) | 5214657 |
Common
| Value | Count | Frequency (%) |
| 1146792 | ||
| . | 147866 | 11.3% |
| - | 4567 | 0.3% |
| ( | 1408 | 0.1% |
| ) | 1408 | 0.1% |
| , | 1090 | 0.1% |
| ' | 972 | 0.1% |
| & | 510 | < 0.1% |
| 2 | 489 | < 0.1% |
| 0 | 361 | < 0.1% |
| Other values (15) | 866 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 19491528 | |
| None | 292 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 2343786 | 12.0% |
| i | 1854416 | 9.5% |
| e | 1259869 | 6.5% |
| s | 1219162 | 6.3% |
| r | 1207808 | 6.2% |
| 1146792 | 5.9% | |
| o | 1138492 | 5.8% |
| l | 1071612 | 5.5% |
| n | 1027785 | 5.3% |
| u | 997253 | 5.1% |
| Other values (66) | 6224553 |
None
| Value | Count | Frequency (%) |
| ë | 174 | |
| ü | 27 | 9.2% |
| ö | 25 | 8.6% |
| á | 23 | 7.9% |
| é | 12 | 4.1% |
| Á | 11 | 3.8% |
| ó | 7 | 2.4% |
| × | 6 | 2.1% |
| É | 4 | 1.4% |
| Ø | 1 | 0.3% |
| Other values (2) | 2 | 0.7% |
protocol
Text
Constant 
| Distinct | 1 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3 |
| Missing (%) | < 0.1% |
| Memory size | 7.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | EML |
|---|---|
| 2nd row | EML |
| 3rd row | EML |
| 4th row | EML |
| 5th row | EML |
| Value | Count | Frequency (%) |
| eml | 988399 |
Most occurring characters
| Value | Count | Frequency (%) |
| E | 988399 | |
| M | 988399 | |
| L | 988399 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 2965197 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| E | 988399 | |
| M | 988399 | |
| L | 988399 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2965197 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| E | 988399 | |
| M | 988399 | |
| L | 988399 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2965197 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| E | 988399 | |
| M | 988399 | |
| L | 988399 |
lastParsed
Text
| Distinct | 200353 |
|---|---|
| Distinct (%) | 20.3% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 7.5 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.99574464 |
| Min length | 6 |
Unique
| Unique | 20728 ? |
|---|---|
| Unique (%) | 2.1% |
Sample
| 1st row | 2024-12-02T13:59:14.452Z |
|---|---|
| 2nd row | 2024-12-02T13:57:49.629Z |
| 3rd row | 2024-12-02T13:57:49.533Z |
| 4th row | 2024-12-02T13:59:17.370Z |
| 5th row | 2024-12-02T13:59:30.710Z |
| Value | Count | Frequency (%) |
| 2024-12-02t13:56:52.667z | 24 | < 0.1% |
| 2024-12-02t13:57:28.323z | 24 | < 0.1% |
| 2024-12-02t13:57:53.831z | 24 | < 0.1% |
| 2024-12-02t13:57:53.200z | 23 | < 0.1% |
| 2024-12-02t13:57:24.579z | 23 | < 0.1% |
| 2024-12-02t13:57:45.844z | 23 | < 0.1% |
| 2024-12-02t13:57:43.276z | 23 | < 0.1% |
| 2024-12-02t13:57:45.207z | 23 | < 0.1% |
| 2024-12-02t13:57:50.630z | 22 | < 0.1% |
| 2024-12-02t13:57:52.903z | 22 | < 0.1% |
| Other values (200343) | 988169 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 4511296 | |
| 0 | 2508198 | |
| 1 | 2493151 | |
| - | 1976798 | |
| : | 1976798 | |
| 4 | 1590376 | 6.7% |
| 5 | 1570391 | 6.6% |
| 3 | 1563965 | 6.6% |
| T | 988399 | 4.2% |
| Z | 988399 | 4.2% |
| Other values (9) | 3549623 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 16799642 | |
| Other Punctuation | 2964150 | 12.5% |
| Uppercase Letter | 1976799 | 8.3% |
| Dash Punctuation | 1976798 | 8.3% |
| Lowercase Letter | 5 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 4511296 | |
| 0 | 2508198 | |
| 1 | 2493151 | |
| 4 | 1590376 | 9.5% |
| 5 | 1570391 | 9.3% |
| 3 | 1563965 | 9.3% |
| 7 | 759694 | 4.5% |
| 9 | 633972 | 3.8% |
| 6 | 594333 | 3.5% |
| 8 | 574266 | 3.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 988399 | |
| Z | 988399 | |
| A | 1 | < 0.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 3 | |
| o | 1 | 20.0% |
| a | 1 | 20.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1976798 | |
| . | 987352 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1976798 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 21740590 | |
| Latin | 1976804 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 4511296 | |
| 0 | 2508198 | |
| 1 | 2493151 | |
| - | 1976798 | |
| : | 1976798 | |
| 4 | 1590376 | 7.3% |
| 5 | 1570391 | 7.2% |
| 3 | 1563965 | 7.2% |
| . | 987352 | 4.5% |
| 7 | 759694 | 3.5% |
| Other values (3) | 1802571 | 8.3% |
Latin
| Value | Count | Frequency (%) |
| T | 988399 | |
| Z | 988399 | |
| n | 3 | < 0.1% |
| A | 1 | < 0.1% |
| o | 1 | < 0.1% |
| a | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23717394 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 4511296 | |
| 0 | 2508198 | |
| 1 | 2493151 | |
| - | 1976798 | |
| : | 1976798 | |
| 4 | 1590376 | 6.7% |
| 5 | 1570391 | 6.6% |
| 3 | 1563965 | 6.6% |
| T | 988399 | 4.2% |
| Z | 988399 | 4.2% |
| Other values (9) | 3549623 |
lastCrawled
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 7.5 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 24 |
| Mean length | 23.99998381 |
| Min length | 8 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 2024-12-02T11:48:23.416Z |
|---|---|
| 2nd row | 2024-12-02T11:48:23.416Z |
| 3rd row | 2024-12-02T11:48:23.416Z |
| 4th row | 2024-12-02T11:48:23.416Z |
| 5th row | 2024-12-02T11:48:23.416Z |
| Value | Count | Frequency (%) |
| 2024-12-02t11:48:23.416z | 988399 | |
| rollinia | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 2 | 4941995 | |
| 1 | 3953596 | |
| 4 | 2965197 | |
| 0 | 1976798 | 8.3% |
| - | 1976798 | 8.3% |
| : | 1976798 | 8.3% |
| Z | 988399 | 4.2% |
| 6 | 988399 | 4.2% |
| . | 988399 | 4.2% |
| 3 | 988399 | 4.2% |
| Other values (8) | 1976806 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 16802783 | |
| Other Punctuation | 2965197 | 12.5% |
| Uppercase Letter | 1976799 | 8.3% |
| Dash Punctuation | 1976798 | 8.3% |
| Lowercase Letter | 7 | < 0.1% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 4941995 | |
| 1 | 3953596 | |
| 4 | 2965197 | |
| 0 | 1976798 | 11.8% |
| 6 | 988399 | 5.9% |
| 3 | 988399 | 5.9% |
| 8 | 988399 | 5.9% |
Lowercase Letter
| Value | Count | Frequency (%) |
| l | 2 | |
| i | 2 | |
| o | 1 | |
| n | 1 | |
| a | 1 |
Uppercase Letter
| Value | Count | Frequency (%) |
| Z | 988399 | |
| T | 988399 | |
| R | 1 | < 0.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 1976798 | |
| . | 988399 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 1976798 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 21744778 | |
| Latin | 1976806 | 8.3% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 2 | 4941995 | |
| 1 | 3953596 | |
| 4 | 2965197 | |
| 0 | 1976798 | 9.1% |
| - | 1976798 | 9.1% |
| : | 1976798 | 9.1% |
| 6 | 988399 | 4.5% |
| . | 988399 | 4.5% |
| 3 | 988399 | 4.5% |
| 8 | 988399 | 4.5% |
Latin
| Value | Count | Frequency (%) |
| Z | 988399 | |
| T | 988399 | |
| l | 2 | < 0.1% |
| i | 2 | < 0.1% |
| R | 1 | < 0.1% |
| o | 1 | < 0.1% |
| n | 1 | < 0.1% |
| a | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23721584 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2 | 4941995 | |
| 1 | 3953596 | |
| 4 | 2965197 | |
| 0 | 1976798 | 8.3% |
| - | 1976798 | 8.3% |
| : | 1976798 | 8.3% |
| Z | 988399 | 4.2% |
| 6 | 988399 | 4.2% |
| . | 988399 | 4.2% |
| 3 | 988399 | 4.2% |
| Other values (8) | 1976806 |
repatriated
Text
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 9253 |
| Missing (%) | 0.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 4 |
| Mean length | 4.297422558 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | false |
|---|---|
| 2nd row | true |
| 3rd row | true |
| 4th row | true |
| 5th row | true |
| Value | Count | Frequency (%) |
| true | 687928 | |
| false | 291221 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 979149 | |
| t | 687928 | |
| r | 687928 | |
| u | 687928 | |
| f | 291221 | 6.9% |
| a | 291221 | 6.9% |
| l | 291221 | 6.9% |
| s | 291221 | 6.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4207817 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 979149 | |
| t | 687928 | |
| r | 687928 | |
| u | 687928 | |
| f | 291221 | 6.9% |
| a | 291221 | 6.9% |
| l | 291221 | 6.9% |
| s | 291221 | 6.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4207817 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 979149 | |
| t | 687928 | |
| r | 687928 | |
| u | 687928 | |
| f | 291221 | 6.9% |
| a | 291221 | 6.9% |
| l | 291221 | 6.9% |
| s | 291221 | 6.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4207817 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 979149 | |
| t | 687928 | |
| r | 687928 | |
| u | 687928 | |
| f | 291221 | 6.9% |
| a | 291221 | 6.9% |
| l | 291221 | 6.9% |
| s | 291221 | 6.9% |
projectId
Text
Constant  Missing 
| Distinct | 1 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 988401 |
| Missing (%) | > 99.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 6 |
| Min length | 6 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | edulis |
|---|
| Value | Count | Frequency (%) |
| edulis | 1 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1 | |
| d | 1 | |
| u | 1 | |
| l | 1 | |
| i | 1 | |
| s | 1 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1 | |
| d | 1 | |
| u | 1 | |
| l | 1 | |
| i | 1 | |
| s | 1 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 6 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1 | |
| d | 1 | |
| u | 1 | |
| l | 1 | |
| i | 1 | |
| s | 1 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 6 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1 | |
| d | 1 | |
| u | 1 | |
| l | 1 | |
| i | 1 | |
| s | 1 |
isSequenced
Text
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 7.5 MiB |
Length
| Max length | 5 |
|---|---|
| Median length | 5 |
| Mean length | 4.999925132 |
| Min length | 4 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | false |
|---|---|
| 2nd row | false |
| 3rd row | false |
| 4th row | false |
| 5th row | false |
| Value | Count | Frequency (%) |
| false | 988325 | |
| true | 74 | < 0.1% |
| acuta | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 988399 | |
| a | 988327 | |
| f | 988325 | |
| l | 988325 | |
| s | 988325 | |
| t | 75 | < 0.1% |
| u | 75 | < 0.1% |
| r | 74 | < 0.1% |
| c | 1 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4941926 |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 988399 | |
| a | 988327 | |
| f | 988325 | |
| l | 988325 | |
| s | 988325 | |
| t | 75 | < 0.1% |
| u | 75 | < 0.1% |
| r | 74 | < 0.1% |
| c | 1 | < 0.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 4941926 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 988399 | |
| a | 988327 | |
| f | 988325 | |
| l | 988325 | |
| s | 988325 | |
| t | 75 | < 0.1% |
| u | 75 | < 0.1% |
| r | 74 | < 0.1% |
| c | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4941926 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 988399 | |
| a | 988327 | |
| f | 988325 | |
| l | 988325 | |
| s | 988325 | |
| t | 75 | < 0.1% |
| u | 75 | < 0.1% |
| r | 74 | < 0.1% |
| c | 1 | < 0.1% |
gbifRegion
Text
Missing 
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 19586 |
| Missing (%) | 2.0% |
| Memory size | 7.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 11.14384878 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | LATIN_AMERICA |
| 3rd row | LATIN_AMERICA |
| 4th row | LATIN_AMERICA |
| 5th row | AFRICA |
| Value | Count | Frequency (%) |
| latin_america | 416098 | |
| north_america | 317523 | |
| asia | 99994 | 10.3% |
| europe | 56004 | 5.8% |
| oceania | 44344 | 4.6% |
| africa | 33918 | 3.5% |
| antarctica | 935 | 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 2242657 | |
| I | 1328910 | |
| R | 1142001 | |
| E | 889973 | 8.2% |
| C | 813753 | 7.5% |
| N | 778900 | 7.2% |
| T | 735491 | 6.8% |
| _ | 733621 | 6.8% |
| M | 733621 | 6.8% |
| O | 417871 | 3.9% |
| Other values (6) | 979541 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 10062718 | |
| Connector Punctuation | 733621 | 6.8% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 2242657 | |
| I | 1328910 | |
| R | 1142001 | |
| E | 889973 | 8.8% |
| C | 813753 | 8.1% |
| N | 778900 | 7.7% |
| T | 735491 | 7.3% |
| M | 733621 | 7.3% |
| O | 417871 | 4.2% |
| L | 416098 | 4.1% |
| Other values (5) | 563443 | 5.6% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 733621 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 10062718 | |
| Common | 733621 | 6.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 2242657 | |
| I | 1328910 | |
| R | 1142001 | |
| E | 889973 | 8.8% |
| C | 813753 | 8.1% |
| N | 778900 | 7.7% |
| T | 735491 | 7.3% |
| M | 733621 | 7.3% |
| O | 417871 | 4.2% |
| L | 416098 | 4.1% |
| Other values (5) | 563443 | 5.6% |
Common
| Value | Count | Frequency (%) |
| _ | 733621 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 10796339 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 2242657 | |
| I | 1328910 | |
| R | 1142001 | |
| E | 889973 | 8.2% |
| C | 813753 | 7.5% |
| N | 778900 | 7.2% |
| T | 735491 | 6.8% |
| _ | 733621 | 6.8% |
| M | 733621 | 6.8% |
| O | 417871 | 3.9% |
| Other values (6) | 979541 |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2 |
| Missing (%) | < 0.1% |
| Memory size | 7.5 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 13 |
| Mean length | 12.99999393 |
| Min length | 7 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | NORTH_AMERICA |
|---|---|
| 2nd row | NORTH_AMERICA |
| 3rd row | NORTH_AMERICA |
| 4th row | NORTH_AMERICA |
| 5th row | NORTH_AMERICA |
| Value | Count | Frequency (%) |
| north_america | 988399 | |
| variety | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| R | 1976799 | |
| A | 1976799 | |
| T | 988400 | |
| E | 988400 | |
| I | 988400 | |
| N | 988399 | |
| O | 988399 | |
| H | 988399 | |
| _ | 988399 | |
| M | 988399 | |
| Other values (3) | 988401 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 11860795 | |
| Connector Punctuation | 988399 | 7.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 1976799 | |
| A | 1976799 | |
| T | 988400 | |
| E | 988400 | |
| I | 988400 | |
| N | 988399 | |
| O | 988399 | |
| H | 988399 | |
| M | 988399 | |
| C | 988399 | |
| Other values (2) | 2 | < 0.1% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 988399 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11860795 | |
| Common | 988399 | 7.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| R | 1976799 | |
| A | 1976799 | |
| T | 988400 | |
| E | 988400 | |
| I | 988400 | |
| N | 988399 | |
| O | 988399 | |
| H | 988399 | |
| M | 988399 | |
| C | 988399 | |
| Other values (2) | 2 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| _ | 988399 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 12849194 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| R | 1976799 | |
| A | 1976799 | |
| T | 988400 | |
| E | 988400 | |
| I | 988400 | |
| N | 988399 | |
| O | 988399 | |
| H | 988399 | |
| _ | 988399 | |
| M | 988399 | |
| Other values (3) | 988401 |
level0Gid
Text
Missing 
| Distinct | 195 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 854767 |
| Missing (%) | 86.5% |
| Memory size | 7.5 MiB |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | DOM |
|---|---|
| 2nd row | CHL |
| 3rd row | GUY |
| 4th row | USA |
| 5th row | BRA |
| Value | Count | Frequency (%) |
| usa | 23761 | |
| guy | 14629 | 10.9% |
| bra | 11793 | 8.8% |
| mex | 10743 | 8.0% |
| ven | 10534 | 7.9% |
| ecu | 6689 | 5.0% |
| guf | 4499 | 3.4% |
| bol | 4471 | 3.3% |
| per | 4388 | 3.3% |
| col | 3749 | 2.8% |
| Other values (185) | 38379 |
Most occurring characters
| Value | Count | Frequency (%) |
| U | 55471 | |
| A | 49231 | |
| E | 33716 | 8.4% |
| S | 30480 | 7.6% |
| R | 25143 | 6.3% |
| G | 24739 | 6.2% |
| N | 22898 | 5.7% |
| C | 20842 | 5.2% |
| B | 18493 | 4.6% |
| M | 17528 | 4.4% |
| Other values (19) | 102364 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 400897 | |
| Decimal Number | 8 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 55471 | |
| A | 49231 | |
| E | 33716 | 8.4% |
| S | 30480 | 7.6% |
| R | 25143 | 6.3% |
| G | 24739 | 6.2% |
| N | 22898 | 5.7% |
| C | 20842 | 5.2% |
| B | 18493 | 4.6% |
| M | 17528 | 4.4% |
| Other values (16) | 102356 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 7 | 3 | |
| 6 | 1 | 12.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 400897 | |
| Common | 8 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 55471 | |
| A | 49231 | |
| E | 33716 | 8.4% |
| S | 30480 | 7.6% |
| R | 25143 | 6.3% |
| G | 24739 | 6.2% |
| N | 22898 | 5.7% |
| C | 20842 | 5.2% |
| B | 18493 | 4.6% |
| M | 17528 | 4.4% |
| Other values (16) | 102356 |
Common
| Value | Count | Frequency (%) |
| 0 | 4 | |
| 7 | 3 | |
| 6 | 1 | 12.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 400905 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| U | 55471 | |
| A | 49231 | |
| E | 33716 | 8.4% |
| S | 30480 | 7.6% |
| R | 25143 | 6.3% |
| G | 24739 | 6.2% |
| N | 22898 | 5.7% |
| C | 20842 | 5.2% |
| B | 18493 | 4.6% |
| M | 17528 | 4.4% |
| Other values (19) | 102364 |
level0Name
Text
Missing 
| Distinct | 195 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 854767 |
| Missing (%) | 86.5% |
| Memory size | 7.5 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 27 |
| Mean length | 8.588745463 |
| Min length | 4 |
Unique
| Unique | 11 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | Dominican Republic |
|---|---|
| 2nd row | Chile |
| 3rd row | Guyana |
| 4th row | United States |
| 5th row | Brazil |
| Value | Count | Frequency (%) |
| united | 23809 | |
| states | 23778 | |
| guyana | 14629 | 8.4% |
| brazil | 11793 | 6.8% |
| méxico | 10743 | 6.2% |
| venezuela | 10534 | 6.1% |
| ecuador | 6689 | 3.9% |
| french | 5197 | 3.0% |
| guiana | 4499 | 2.6% |
| bolivia | 4471 | 2.6% |
| Other values (224) | 57483 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 163858 | |
| e | 105662 | 9.2% |
| i | 94961 | 8.3% |
| n | 84634 | 7.4% |
| t | 81902 | 7.1% |
| u | 56381 | 4.9% |
| r | 43581 | 3.8% |
| o | 42775 | 3.7% |
| 39990 | 3.5% | |
| l | 39687 | 3.5% |
| Other values (52) | 394326 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 933880 | |
| Uppercase Letter | 173438 | 15.1% |
| Space Separator | 39990 | 3.5% |
| Other Punctuation | 441 | < 0.1% |
| Dash Punctuation | 4 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 163858 | |
| e | 105662 | |
| i | 94961 | |
| n | 84634 | |
| t | 81902 | |
| u | 56381 | 6.0% |
| r | 43581 | 4.7% |
| o | 42775 | 4.6% |
| l | 39687 | 4.2% |
| d | 38966 | 4.2% |
| Other values (21) | 181473 |
Uppercase Letter
| Value | Count | Frequency (%) |
| S | 29799 | |
| U | 24119 | |
| G | 21942 | |
| B | 17299 | |
| M | 13763 | |
| C | 13726 | |
| V | 10908 | 6.3% |
| P | 10111 | 5.8% |
| E | 7473 | 4.3% |
| F | 5341 | 3.1% |
| Other values (14) | 18957 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 192 | |
| , | 168 | |
| ' | 81 |
Space Separator
| Value | Count | Frequency (%) |
| 39990 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1107318 | |
| Common | 40439 | 3.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 163858 | |
| e | 105662 | 9.5% |
| i | 94961 | 8.6% |
| n | 84634 | 7.6% |
| t | 81902 | 7.4% |
| u | 56381 | 5.1% |
| r | 43581 | 3.9% |
| o | 42775 | 3.9% |
| l | 39687 | 3.6% |
| d | 38966 | 3.5% |
| Other values (45) | 354911 |
Common
| Value | Count | Frequency (%) |
| 39990 | ||
| . | 192 | 0.5% |
| , | 168 | 0.4% |
| ' | 81 | 0.2% |
| - | 4 | < 0.1% |
| ( | 2 | < 0.1% |
| ) | 2 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1136834 | |
| None | 10923 | 1.0% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 163858 | |
| e | 105662 | 9.3% |
| i | 94961 | 8.4% |
| n | 84634 | 7.4% |
| t | 81902 | 7.2% |
| u | 56381 | 5.0% |
| r | 43581 | 3.8% |
| o | 42775 | 3.8% |
| 39990 | 3.5% | |
| l | 39687 | 3.5% |
| Other values (47) | 383403 |
None
| Value | Count | Frequency (%) |
| é | 10760 | |
| ô | 81 | 0.7% |
| ç | 68 | 0.6% |
| ã | 7 | 0.1% |
| í | 7 | 0.1% |
level1Gid
Text
Missing 
| Distinct | 1703 |
|---|---|
| Distinct (%) | 1.3% |
| Missing | 855021 |
| Missing (%) | 86.5% |
| Memory size | 7.5 MiB |
Length
| Max length | 8 |
|---|---|
| Median length | 7 |
| Mean length | 7.447372564 |
| Min length | 6 |
Unique
| Unique | 338 ? |
|---|---|
| Unique (%) | 0.3% |
Sample
| 1st row | DOM.26_1 |
|---|---|
| 2nd row | CHL.6_1 |
| 3rd row | GUY.2_1 |
| 4th row | USA.47_1 |
| 5th row | BRA.1_1 |
| Value | Count | Frequency (%) |
| usa.21_1 | 4566 | 3.4% |
| guy.8_1 | 4001 | 3.0% |
| usa.47_1 | 3961 | 3.0% |
| guy.10_1 | 3952 | 3.0% |
| guy.2_1 | 3604 | 2.7% |
| ven.1_1 | 3448 | 2.6% |
| usa.9_1 | 3286 | 2.5% |
| ven.6_1 | 3215 | 2.4% |
| guf.1_1 | 2886 | 2.2% |
| usa.2_1 | 2677 | 2.0% |
| Other values (1693) | 97785 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1 | 178102 | |
| _ | 133380 | |
| . | 133323 | |
| U | 55403 | 5.6% |
| A | 48963 | 4.9% |
| 2 | 44340 | 4.5% |
| E | 33716 | 3.4% |
| S | 30480 | 3.1% |
| R | 25125 | 2.5% |
| G | 24740 | 2.5% |
| Other values (28) | 285766 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 400138 | |
| Decimal Number | 326497 | |
| Connector Punctuation | 133380 | 13.4% |
| Other Punctuation | 133323 | 13.4% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 55403 | |
| A | 48963 | |
| E | 33716 | 8.4% |
| S | 30480 | 7.6% |
| R | 25125 | 6.3% |
| G | 24740 | 6.2% |
| N | 22866 | 5.7% |
| C | 20750 | 5.2% |
| B | 18493 | 4.6% |
| M | 17528 | 4.4% |
| Other values (16) | 102074 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 178102 | |
| 2 | 44340 | 13.6% |
| 4 | 18773 | 5.7% |
| 3 | 15053 | 4.6% |
| 9 | 12569 | 3.8% |
| 6 | 12395 | 3.8% |
| 5 | 12323 | 3.8% |
| 8 | 12169 | 3.7% |
| 0 | 11022 | 3.4% |
| 7 | 9751 | 3.0% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 133380 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 133323 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 593200 | |
| Latin | 400138 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 55403 | |
| A | 48963 | |
| E | 33716 | 8.4% |
| S | 30480 | 7.6% |
| R | 25125 | 6.3% |
| G | 24740 | 6.2% |
| N | 22866 | 5.7% |
| C | 20750 | 5.2% |
| B | 18493 | 4.6% |
| M | 17528 | 4.4% |
| Other values (16) | 102074 |
Common
| Value | Count | Frequency (%) |
| 1 | 178102 | |
| _ | 133380 | |
| . | 133323 | |
| 2 | 44340 | 7.5% |
| 4 | 18773 | 3.2% |
| 3 | 15053 | 2.5% |
| 9 | 12569 | 2.1% |
| 6 | 12395 | 2.1% |
| 5 | 12323 | 2.1% |
| 8 | 12169 | 2.1% |
| Other values (2) | 20773 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 993338 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1 | 178102 | |
| _ | 133380 | |
| . | 133323 | |
| U | 55403 | 5.6% |
| A | 48963 | 4.9% |
| 2 | 44340 | 4.5% |
| E | 33716 | 3.4% |
| S | 30480 | 3.1% |
| R | 25125 | 2.5% |
| G | 24740 | 2.5% |
| Other values (28) | 285766 |
level1Name
Text
Missing 
| Distinct | 1634 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 855020 |
| Missing (%) | 86.5% |
| Memory size | 7.5 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 29 |
| Mean length | 10.18645694 |
| Min length | 3 |
Unique
| Unique | 323 ? |
|---|---|
| Unique (%) | 0.2% |
Sample
| 1st row | San Juan |
|---|---|
| 2nd row | Bío-Bío |
| 3rd row | Cuyuni-Mazaruni |
| 4th row | Virginia |
| 5th row | Acre |
| Value | Count | Frequency (%) |
| amazonas | 5737 | 3.2% |
| upper | 4920 | 2.7% |
| maryland | 4566 | 2.5% |
| essequibo | 4171 | 2.3% |
| potaro-siparuni | 4001 | 2.2% |
| virginia | 3991 | 2.2% |
| takutu-upper | 3952 | 2.2% |
| columbia | 3813 | 2.1% |
| cuyuni-mazaruni | 3604 | 2.0% |
| district | 3288 | 1.8% |
| Other values (1775) | 138660 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 199066 | |
| i | 109009 | 8.0% |
| r | 94704 | 7.0% |
| n | 89929 | 6.6% |
| o | 85891 | 6.3% |
| e | 69436 | 5.1% |
| u | 67791 | 5.0% |
| s | 50157 | 3.7% |
| t | 48416 | 3.6% |
| 47321 | 3.5% | |
| Other values (110) | 496970 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1094781 | |
| Uppercase Letter | 194205 | 14.3% |
| Space Separator | 47321 | 3.5% |
| Dash Punctuation | 21841 | 1.6% |
| Other Punctuation | 532 | < 0.1% |
| Modifier Symbol | 6 | < 0.1% |
| Close Punctuation | 2 | < 0.1% |
| Open Punctuation | 2 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 199066 | |
| i | 109009 | |
| r | 94704 | 8.7% |
| n | 89929 | 8.2% |
| o | 85891 | 7.8% |
| e | 69436 | 6.3% |
| u | 67791 | 6.2% |
| s | 50157 | 4.6% |
| t | 48416 | 4.4% |
| l | 43442 | 4.0% |
| Other values (68) | 236940 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 26555 | |
| M | 21196 | |
| S | 19522 | 10.1% |
| A | 15466 | 8.0% |
| P | 14898 | 7.7% |
| B | 12281 | 6.3% |
| T | 9818 | 5.1% |
| U | 9558 | 4.9% |
| D | 7694 | 4.0% |
| N | 7614 | 3.9% |
| Other values (22) | 49603 |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 208 | |
| ! | 144 | |
| . | 122 | |
| , | 57 | 10.7% |
| / | 1 | 0.2% |
Space Separator
| Value | Count | Frequency (%) |
| 47321 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 21841 |
Modifier Symbol
| Value | Count | Frequency (%) |
| ` | 6 |
Close Punctuation
| Value | Count | Frequency (%) |
| ] | 2 |
Open Punctuation
| Value | Count | Frequency (%) |
| [ | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1288986 | |
| Common | 69704 | 5.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 199066 | |
| i | 109009 | 8.5% |
| r | 94704 | 7.3% |
| n | 89929 | 7.0% |
| o | 85891 | 6.7% |
| e | 69436 | 5.4% |
| u | 67791 | 5.3% |
| s | 50157 | 3.9% |
| t | 48416 | 3.8% |
| l | 43442 | 3.4% |
| Other values (100) | 431145 |
Common
| Value | Count | Frequency (%) |
| 47321 | ||
| - | 21841 | |
| ' | 208 | 0.3% |
| ! | 144 | 0.2% |
| . | 122 | 0.2% |
| , | 57 | 0.1% |
| ` | 6 | < 0.1% |
| ] | 2 | < 0.1% |
| [ | 2 | < 0.1% |
| / | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1339717 | |
| None | 18880 | 1.4% |
| Latin Ext Additional | 93 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 199066 | |
| i | 109009 | 8.1% |
| r | 94704 | 7.1% |
| n | 89929 | 6.7% |
| o | 85891 | 6.4% |
| e | 69436 | 5.2% |
| u | 67791 | 5.1% |
| s | 50157 | 3.7% |
| t | 48416 | 3.6% |
| 47321 | 3.5% | |
| Other values (52) | 477997 |
None
| Value | Count | Frequency (%) |
| í | 6177 | |
| á | 4744 | |
| é | 3338 | |
| ó | 1150 | 6.1% |
| ã | 984 | 5.2% |
| Î | 684 | 3.6% |
| ô | 454 | 2.4% |
| ñ | 381 | 2.0% |
| Ñ | 252 | 1.3% |
| ö | 121 | 0.6% |
| Other values (37) | 595 | 3.2% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ạ | 25 | |
| ồ | 22 | |
| ắ | 18 | |
| ộ | 9 | 9.7% |
| ằ | 5 | 5.4% |
| ị | 4 | 4.3% |
| ả | 3 | 3.2% |
| ẵ | 3 | 3.2% |
| ệ | 2 | 2.2% |
| ế | 1 | 1.1% |
level2Gid
Text
Missing 
| Distinct | 7917 |
|---|---|
| Distinct (%) | 6.1% |
| Missing | 859029 |
| Missing (%) | 86.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 9.905212061 |
| Min length | 8 |
Unique
| Unique | 2525 ? |
|---|---|
| Unique (%) | 2.0% |
Sample
| 1st row | DOM.26.2_1 |
|---|---|
| 2nd row | CHL.6.3_1 |
| 3rd row | GUY.2.5_1 |
| 4th row | USA.47.8_1 |
| 5th row | BRA.1.11_2 |
| Value | Count | Frequency (%) |
| usa.9.1_1 | 3286 | 2.5% |
| guy.8.8_1 | 3032 | 2.3% |
| guy.2.8_1 | 2312 | 1.8% |
| guy.10.4_1 | 2189 | 1.7% |
| usa.21.15_1 | 1956 | 1.5% |
| usa.21.16_1 | 1386 | 1.1% |
| ven.6.5_1 | 1255 | 1.0% |
| ven.1.7_1 | 1231 | 1.0% |
| usa.47.102_1 | 1092 | 0.8% |
| usa.2.17_1 | 1081 | 0.8% |
| Other values (7907) | 110553 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 258687 | |
| 1 | 192666 | |
| _ | 129373 | 10.1% |
| 2 | 95635 | 7.5% |
| U | 55286 | 4.3% |
| A | 48244 | 3.8% |
| 4 | 37638 | 2.9% |
| E | 33642 | 2.6% |
| 3 | 32693 | 2.6% |
| S | 29992 | 2.3% |
| Other values (28) | 367611 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 505296 | |
| Uppercase Letter | 388111 | |
| Other Punctuation | 258687 | |
| Connector Punctuation | 129373 | 10.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| U | 55286 | |
| A | 48244 | |
| E | 33642 | 8.7% |
| S | 29992 | 7.7% |
| G | 24307 | 6.3% |
| R | 23607 | 6.1% |
| N | 22823 | 5.9% |
| C | 20618 | 5.3% |
| B | 17749 | 4.6% |
| M | 16726 | 4.3% |
| Other values (16) | 95117 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 192666 | |
| 2 | 95635 | |
| 4 | 37638 | 7.4% |
| 3 | 32693 | 6.5% |
| 5 | 29765 | 5.9% |
| 8 | 28434 | 5.6% |
| 6 | 25444 | 5.0% |
| 7 | 21723 | 4.3% |
| 9 | 21112 | 4.2% |
| 0 | 20186 | 4.0% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 258687 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 129373 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 893356 | |
| Latin | 388111 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| U | 55286 | |
| A | 48244 | |
| E | 33642 | 8.7% |
| S | 29992 | 7.7% |
| G | 24307 | 6.3% |
| R | 23607 | 6.1% |
| N | 22823 | 5.9% |
| C | 20618 | 5.3% |
| B | 17749 | 4.6% |
| M | 16726 | 4.3% |
| Other values (16) | 95117 |
Common
| Value | Count | Frequency (%) |
| . | 258687 | |
| 1 | 192666 | |
| _ | 129373 | |
| 2 | 95635 | 10.7% |
| 4 | 37638 | 4.2% |
| 3 | 32693 | 3.7% |
| 5 | 29765 | 3.3% |
| 8 | 28434 | 3.2% |
| 6 | 25444 | 2.8% |
| 7 | 21723 | 2.4% |
| Other values (2) | 41298 | 4.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1281467 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 258687 | |
| 1 | 192666 | |
| _ | 129373 | 10.1% |
| 2 | 95635 | 7.5% |
| U | 55286 | 4.3% |
| A | 48244 | 3.8% |
| 4 | 37638 | 2.9% |
| E | 33642 | 2.6% |
| 3 | 32693 | 2.6% |
| S | 29992 | 2.3% |
| Other values (28) | 367611 |
level2Name
Text
Missing 
| Distinct | 7281 |
|---|---|
| Distinct (%) | 5.6% |
| Missing | 859040 |
| Missing (%) | 86.9% |
| Memory size | 7.5 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 27 |
| Mean length | 10.8988652 |
| Min length | 1 |
Unique
| Unique | 2224 ? |
|---|---|
| Unique (%) | 1.7% |
Sample
| 1st row | El Cercado |
|---|---|
| 2nd row | Concepción |
| 3rd row | Kamarang |
| 4th row | Arlington |
| 5th row | Manoel Urbano |
| Value | Count | Frequency (%) |
| of | 11716 | 5.2% |
| rest | 8168 | 3.6% |
| region | 8145 | 3.6% |
| 3557 | 1.6% | |
| de | 3551 | 1.6% |
| district | 3288 | 1.5% |
| columbia | 3288 | 1.5% |
| 8 | 3040 | 1.4% |
| san | 2745 | 1.2% |
| prince | 2492 | 1.1% |
| Other values (7516) | 174499 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 163519 | 11.6% |
| o | 112697 | 8.0% |
| e | 96344 | 6.8% |
| 95127 | 6.7% | |
| i | 93298 | 6.6% |
| n | 91747 | 6.5% |
| r | 79497 | 5.6% |
| t | 58541 | 4.2% |
| u | 49822 | 3.5% |
| l | 48733 | 3.5% |
| Other values (134) | 520574 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1091009 | |
| Uppercase Letter | 196524 | 13.9% |
| Space Separator | 95127 | 6.7% |
| Decimal Number | 10222 | 0.7% |
| Other Punctuation | 6418 | 0.5% |
| Dash Punctuation | 6284 | 0.4% |
| Open Punctuation | 2160 | 0.2% |
| Close Punctuation | 1175 | 0.1% |
| Math Symbol | 980 | 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 163519 | |
| o | 112697 | |
| e | 96344 | 8.8% |
| i | 93298 | 8.6% |
| n | 91747 | 8.4% |
| r | 79497 | 7.3% |
| t | 58541 | 5.4% |
| u | 49822 | 4.6% |
| l | 48733 | 4.5% |
| s | 44464 | 4.1% |
| Other values (70) | 252347 |
Uppercase Letter
| Value | Count | Frequency (%) |
| R | 24531 | |
| C | 21043 | 10.7% |
| S | 18194 | 9.3% |
| M | 15534 | 7.9% |
| A | 12868 | 6.5% |
| P | 12479 | 6.3% |
| B | 9745 | 5.0% |
| D | 8437 | 4.3% |
| N | 8307 | 4.2% |
| L | 8136 | 4.1% |
| Other values (34) | 57250 |
Decimal Number
| Value | Count | Frequency (%) |
| 8 | 3091 | |
| 7 | 2342 | |
| 9 | 2243 | |
| 1 | 1445 | |
| 0 | 689 | 6.7% |
| 6 | 126 | 1.2% |
| 2 | 102 | 1.0% |
| 3 | 96 | 0.9% |
| 5 | 67 | 0.7% |
| 4 | 21 | 0.2% |
Other Punctuation
| Value | Count | Frequency (%) |
| ' | 1961 | |
| . | 1603 | |
| / | 1439 | |
| , | 1370 | |
| ? | 45 | 0.7% |
Space Separator
| Value | Count | Frequency (%) |
| 95127 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 6284 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 2160 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1175 |
Math Symbol
| Value | Count | Frequency (%) |
| + | 980 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1287533 | |
| Common | 122366 | 8.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 163519 | 12.7% |
| o | 112697 | 8.8% |
| e | 96344 | 7.5% |
| i | 93298 | 7.2% |
| n | 91747 | 7.1% |
| r | 79497 | 6.2% |
| t | 58541 | 4.5% |
| u | 49822 | 3.9% |
| l | 48733 | 3.8% |
| s | 44464 | 3.5% |
| Other values (114) | 448871 |
Common
| Value | Count | Frequency (%) |
| 95127 | ||
| - | 6284 | 5.1% |
| 8 | 3091 | 2.5% |
| 7 | 2342 | 1.9% |
| 9 | 2243 | 1.8% |
| ( | 2160 | 1.8% |
| ' | 1961 | 1.6% |
| . | 1603 | 1.3% |
| 1 | 1445 | 1.2% |
| / | 1439 | 1.2% |
| Other values (10) | 4671 | 3.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1384947 | |
| None | 24804 | 1.8% |
| Latin Ext Additional | 148 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 163519 | 11.8% |
| o | 112697 | 8.1% |
| e | 96344 | 7.0% |
| 95127 | 6.9% | |
| i | 93298 | 6.7% |
| n | 91747 | 6.6% |
| r | 79497 | 5.7% |
| t | 58541 | 4.2% |
| u | 49822 | 3.6% |
| l | 48733 | 3.5% |
| Other values (62) | 495622 |
None
| Value | Count | Frequency (%) |
| í | 4840 | |
| á | 4536 | |
| é | 4424 | |
| ó | 3619 | |
| ã | 1773 | 7.1% |
| ñ | 1314 | 5.3% |
| ê | 791 | 3.2% |
| ü | 763 | 3.1% |
| ú | 647 | 2.6% |
| ç | 549 | 2.2% |
| Other values (47) | 1548 | 6.2% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ạ | 50 | |
| ả | 37 | |
| ữ | 14 | 9.5% |
| ồ | 11 | 7.4% |
| ủ | 11 | 7.4% |
| ợ | 10 | 6.8% |
| ị | 3 | 2.0% |
| ụ | 2 | 1.4% |
| ể | 2 | 1.4% |
| ậ | 2 | 1.4% |
| Other values (5) | 6 | 4.1% |
level3Gid
Text
Missing 
| Distinct | 4058 |
|---|---|
| Distinct (%) | 11.6% |
| Missing | 953538 |
| Missing (%) | 96.5% |
| Memory size | 7.5 MiB |
Length
| Max length | 36 |
|---|---|
| Median length | 15 |
| Mean length | 11.74667279 |
| Min length | 11 |
Unique
| Unique | 1545 ? |
|---|---|
| Unique (%) | 4.4% |
Sample
| 1st row | CHL.6.3.12_1 |
|---|---|
| 2nd row | PER.18.1.3_1 |
| 3rd row | CRI.4.5.4_1 |
| 4th row | ECU.21.2.1_1 |
| 5th row | PER.8.9.1_1 |
| Value | Count | Frequency (%) |
| per.8.9.1_1 | 481 | 1.4% |
| per.18.3.4_1 | 344 | 1.0% |
| ecu.14.14.2_1 | 335 | 1.0% |
| bol.4.17.4_2 | 316 | 0.9% |
| can.6.1.8_1 | 291 | 0.8% |
| ecu.17.4.1_1 | 285 | 0.8% |
| bol.8.14.1_2 | 276 | 0.8% |
| can.13.1.35_1 | 214 | 0.6% |
| bol.4.18.2_2 | 207 | 0.6% |
| per.20.2.4_1 | 189 | 0.5% |
| Other values (4048) | 31926 |
Most occurring characters
| Value | Count | Frequency (%) |
| . | 104589 | |
| 1 | 65183 | |
| _ | 34863 | 8.5% |
| 2 | 26503 | 6.5% |
| C | 15795 | 3.9% |
| 4 | 15291 | 3.7% |
| 3 | 14173 | 3.5% |
| E | 12203 | 3.0% |
| 6 | 9734 | 2.4% |
| 5 | 9308 | 2.3% |
| Other values (32) | 101894 |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 165485 | |
| Other Punctuation | 104589 | |
| Uppercase Letter | 104581 | |
| Connector Punctuation | 34863 | 8.5% |
| Lowercase Letter | 14 | < 0.1% |
| Dash Punctuation | 4 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 15795 | |
| E | 12203 | |
| N | 8627 | 8.2% |
| A | 8156 | 7.8% |
| R | 7260 | 6.9% |
| U | 7217 | 6.9% |
| P | 6249 | 6.0% |
| L | 6230 | 6.0% |
| H | 5144 | 4.9% |
| B | 4792 | 4.6% |
| Other values (14) | 22908 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 65183 | |
| 2 | 26503 | |
| 4 | 15291 | 9.2% |
| 3 | 14173 | 8.6% |
| 6 | 9734 | 5.9% |
| 5 | 9308 | 5.6% |
| 8 | 8312 | 5.0% |
| 7 | 6139 | 3.7% |
| 9 | 6133 | 3.7% |
| 0 | 4709 | 2.8% |
Lowercase Letter
| Value | Count | Frequency (%) |
| c | 4 | |
| a | 4 | |
| b | 3 | |
| d | 2 | |
| e | 1 | 7.1% |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 104589 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 34863 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 4 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 304941 | |
| Latin | 104595 | 25.5% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| C | 15795 | |
| E | 12203 | |
| N | 8627 | 8.2% |
| A | 8156 | 7.8% |
| R | 7260 | 6.9% |
| U | 7217 | 6.9% |
| P | 6249 | 6.0% |
| L | 6230 | 6.0% |
| H | 5144 | 4.9% |
| B | 4792 | 4.6% |
| Other values (19) | 22922 |
Common
| Value | Count | Frequency (%) |
| . | 104589 | |
| 1 | 65183 | |
| _ | 34863 | 11.4% |
| 2 | 26503 | 8.7% |
| 4 | 15291 | 5.0% |
| 3 | 14173 | 4.6% |
| 6 | 9734 | 3.2% |
| 5 | 9308 | 3.1% |
| 8 | 8312 | 2.7% |
| 7 | 6139 | 2.0% |
| Other values (3) | 10846 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 409536 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| . | 104589 | |
| 1 | 65183 | |
| _ | 34863 | 8.5% |
| 2 | 26503 | 6.5% |
| C | 15795 | 3.9% |
| 4 | 15291 | 3.7% |
| 3 | 14173 | 3.5% |
| E | 12203 | 3.0% |
| 6 | 9734 | 2.4% |
| 5 | 9308 | 2.3% |
| Other values (32) | 101894 |
level3Name
Text
Missing 
| Distinct | 3831 |
|---|---|
| Distinct (%) | 11.1% |
| Missing | 953860 |
| Missing (%) | 96.5% |
| Memory size | 7.5 MiB |
Length
| Max length | 32 |
|---|---|
| Median length | 28 |
| Mean length | 10.60914249 |
| Min length | 2 |
Unique
| Unique | 1425 ? |
|---|---|
| Unique (%) | 4.1% |
Sample
| 1st row | Tomé |
|---|---|
| 2nd row | Manu |
| 3rd row | San José |
| 4th row | Alluriquin |
| 5th row | Echarate |
| Value | Count | Frequency (%) |
| san | 1730 | 3.1% |
| de | 1393 | 2.5% |
| unorganized | 1082 | 1.9% |
| la | 844 | 1.5% |
| el | 708 | 1.3% |
| no | 616 | 1.1% |
| division | 487 | 0.9% |
| echarate | 481 | 0.9% |
| santa | 470 | 0.8% |
| en | 449 | 0.8% |
| Other values (4214) | 48076 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 52267 | 14.3% |
| n | 25746 | 7.0% |
| o | 25629 | 7.0% |
| i | 23308 | 6.4% |
| 21794 | 5.9% | |
| e | 21436 | 5.8% |
| r | 18083 | 4.9% |
| u | 14350 | 3.9% |
| l | 13877 | 3.8% |
| t | 11765 | 3.2% |
| Other values (117) | 138206 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 281473 | |
| Uppercase Letter | 54853 | 15.0% |
| Space Separator | 21794 | 5.9% |
| Other Punctuation | 3254 | 0.9% |
| Decimal Number | 1728 | 0.5% |
| Open Punctuation | 1395 | 0.4% |
| Close Punctuation | 1091 | 0.3% |
| Dash Punctuation | 865 | 0.2% |
| Final Punctuation | 8 | < 0.1% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 52267 | |
| n | 25746 | 9.1% |
| o | 25629 | 9.1% |
| i | 23308 | 8.3% |
| e | 21436 | 7.6% |
| r | 18083 | 6.4% |
| u | 14350 | 5.1% |
| l | 13877 | 4.9% |
| t | 11765 | 4.2% |
| s | 8645 | 3.1% |
| Other values (63) | 66367 |
Uppercase Letter
| Value | Count | Frequency (%) |
| C | 5987 | 10.9% |
| S | 5956 | 10.9% |
| M | 3594 | 6.6% |
| P | 3581 | 6.5% |
| A | 3449 | 6.3% |
| B | 3173 | 5.8% |
| T | 3172 | 5.8% |
| L | 3070 | 5.6% |
| N | 2650 | 4.8% |
| D | 2344 | 4.3% |
| Other values (21) | 17877 |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 735 | |
| 2 | 253 | 14.6% |
| 4 | 166 | 9.6% |
| 0 | 127 | 7.3% |
| 3 | 119 | 6.9% |
| 9 | 111 | 6.4% |
| 8 | 65 | 3.8% |
| 6 | 55 | 3.2% |
| 5 | 50 | 2.9% |
| 7 | 47 | 2.7% |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 1547 | |
| . | 1501 | |
| ' | 135 | 4.1% |
| / | 40 | 1.2% |
| ! | 15 | 0.5% |
| : | 9 | 0.3% |
| " | 6 | 0.2% |
| ? | 1 | < 0.1% |
Space Separator
| Value | Count | Frequency (%) |
| 21794 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 1395 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 1091 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 865 |
Final Punctuation
| Value | Count | Frequency (%) |
| ’ | 8 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 336326 | |
| Common | 30135 | 8.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 52267 | |
| n | 25746 | 7.7% |
| o | 25629 | 7.6% |
| i | 23308 | 6.9% |
| e | 21436 | 6.4% |
| r | 18083 | 5.4% |
| u | 14350 | 4.3% |
| l | 13877 | 4.1% |
| t | 11765 | 3.5% |
| s | 8645 | 2.6% |
| Other values (94) | 121220 |
Common
| Value | Count | Frequency (%) |
| 21794 | ||
| , | 1547 | 5.1% |
| . | 1501 | 5.0% |
| ( | 1395 | 4.6% |
| ) | 1091 | 3.6% |
| - | 865 | 2.9% |
| 1 | 735 | 2.4% |
| 2 | 253 | 0.8% |
| 4 | 166 | 0.6% |
| ' | 135 | 0.4% |
| Other values (13) | 653 | 2.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 363392 | |
| None | 2910 | 0.8% |
| Latin Ext Additional | 151 | < 0.1% |
| Punctuation | 8 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 52267 | 14.4% |
| n | 25746 | 7.1% |
| o | 25629 | 7.1% |
| i | 23308 | 6.4% |
| 21794 | 6.0% | |
| e | 21436 | 5.9% |
| r | 18083 | 5.0% |
| u | 14350 | 3.9% |
| l | 13877 | 3.8% |
| t | 11765 | 3.2% |
| Other values (64) | 135137 |
None
| Value | Count | Frequency (%) |
| ñ | 638 | |
| é | 530 | |
| ó | 426 | |
| í | 303 | |
| á | 270 | |
| ê | 263 | |
| ü | 141 | 4.8% |
| ú | 60 | 2.1% |
| è | 58 | 2.0% |
| ơ | 35 | 1.2% |
| Other values (23) | 186 | 6.4% |
Latin Ext Additional
| Value | Count | Frequency (%) |
| ả | 28 | |
| ế | 28 | |
| ỷ | 17 | |
| ộ | 14 | |
| ữ | 12 | |
| ồ | 10 | 6.6% |
| ằ | 9 | 6.0% |
| ố | 8 | 5.3% |
| ạ | 6 | 4.0% |
| ờ | 4 | 2.6% |
| Other values (9) | 15 |
Punctuation
| Value | Count | Frequency (%) |
| ’ | 8 |
Missing 
| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 91545 |
| Missing (%) | 9.3% |
| Memory size | 7.5 MiB |
Length
| Max length | 24 |
|---|---|
| Median length | 2 |
| Mean length | 2.00002453 |
| Min length | 2 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | NE |
|---|---|
| 2nd row | NE |
| 3rd row | NE |
| 4th row | NE |
| 5th row | NE |
| Value | Count | Frequency (%) |
| ne | 712718 | |
| lc | 165443 | 18.4% |
| vu | 6108 | 0.7% |
| en | 4438 | 0.5% |
| nt | 3884 | 0.4% |
| dd | 2382 | 0.3% |
| cr | 1766 | 0.2% |
| ew | 91 | < 0.1% |
| ex | 26 | < 0.1% |
| 2024-12-02t13:56:28.527z | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| N | 721040 | |
| E | 717273 | |
| C | 167209 | 9.3% |
| L | 165443 | 9.2% |
| V | 6108 | 0.3% |
| U | 6108 | 0.3% |
| D | 4764 | 0.3% |
| T | 3885 | 0.2% |
| R | 1766 | 0.1% |
| W | 91 | < 0.1% |
| Other values (14) | 49 | < 0.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 1793714 | |
| Decimal Number | 17 | < 0.1% |
| Other Punctuation | 3 | < 0.1% |
| Dash Punctuation | 2 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 721040 | |
| E | 717273 | |
| C | 167209 | 9.3% |
| L | 165443 | 9.2% |
| V | 6108 | 0.3% |
| U | 6108 | 0.3% |
| D | 4764 | 0.3% |
| T | 3885 | 0.2% |
| R | 1766 | 0.1% |
| W | 91 | < 0.1% |
| Other values (2) | 27 | < 0.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 6 | |
| 1 | 2 | 11.8% |
| 5 | 2 | 11.8% |
| 0 | 2 | 11.8% |
| 3 | 1 | 5.9% |
| 4 | 1 | 5.9% |
| 6 | 1 | 5.9% |
| 8 | 1 | 5.9% |
| 7 | 1 | 5.9% |
Other Punctuation
| Value | Count | Frequency (%) |
| : | 2 | |
| . | 1 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1793714 | |
| Common | 22 | < 0.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| N | 721040 | |
| E | 717273 | |
| C | 167209 | 9.3% |
| L | 165443 | 9.2% |
| V | 6108 | 0.3% |
| U | 6108 | 0.3% |
| D | 4764 | 0.3% |
| T | 3885 | 0.2% |
| R | 1766 | 0.1% |
| W | 91 | < 0.1% |
| Other values (2) | 27 | < 0.1% |
Common
| Value | Count | Frequency (%) |
| 2 | 6 | |
| 1 | 2 | 9.1% |
| 5 | 2 | 9.1% |
| : | 2 | 9.1% |
| 0 | 2 | 9.1% |
| - | 2 | 9.1% |
| 3 | 1 | 4.5% |
| 4 | 1 | 4.5% |
| 6 | 1 | 4.5% |
| 8 | 1 | 4.5% |
| Other values (2) | 2 | 9.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1793736 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| N | 721040 | |
| E | 717273 | |
| C | 167209 | 9.3% |
| L | 165443 | 9.2% |
| V | 6108 | 0.3% |
| U | 6108 | 0.3% |
| D | 4764 | 0.3% |
| T | 3885 | 0.2% |
| R | 1766 | 0.1% |
| W | 91 | < 0.1% |
| Other values (14) | 49 | < 0.1% |